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pSPRHS30b E.co/z'-Micromonospora shuttle vector 




pSPRH830b - pSPRH826b backbone 



Function 


Source 


- Ampicillin resistance 


(pUC18) 


- Multiple cloning site 


( P UC18) 


- pUC18 origin 


(pUC!8) 


- Hygrornycin resistance 


(pl6Rl) 


- oriT (origin of transfer) 


(pRL1058) 


- pIJ702 origin of replication 


(pU702) 
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pSPRH840 integrating vector 

Figure 7 A 




pSPRH840 - pSPRH826b backbone, pMLPl xis, int attP insert 
pSPRH840 conjugated 

from E. coli into HmR transformants obtained 

M.carbonacea + 

M.rosaria 

M.halophitica + 
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233 gtul*jjlvcggkx3xgag^ i j i iiuu^ccG AA ^ i iAXC fca^rara ^ . ,i :uwivi\^ U3CCACocAaccA :vivv:^ T^ ^ \v ^A A 

349 C^VJU XUl^UJUJLW VgCCAaCT^ ill,,^,. 11A TCAJCOSCT 

581 CCATQCCOCAeA CTCOCCCCCG AflCO GlU 1 l\J* LX. XXXK^&T Z H JU l\\^JJi^J ^^ 

613 ACACCTATGAAg 3 GAA«*AGAT rc C aa^^ ^ IUIA, |\. 1M V COCJUUUXXJUkCOCCACAOGTC^^ 

929 GgrCCGA CC 1 I ' l ACGACCTAa3m\yit^>^ft5K3<^3CAA^ i VAAA^VA/OX>CATGfc> AA Ai CA TAC V l l^ \ J^lVA.ll\^\J l3CrarCC^^ 

1> U RNTPGLGRGTWA-AYVLTARERAGLTk 

1044 AGCGAglTGOCt^OOCOCATXX^ 

28* S E L A R R I QKDRATVGRWEDGKNRPODADl 

1160 CGA a^lCA CGAAG^CrC GC CGCC GC A<M7CTGeOCCTC T V^ O 1 IIIjCACC 

66» D L DEALAAA G L RPGVTPPA TPTMDL DEE I ELVRTOPKL 

1276 ACGAGGACATGAAGCQGC(gATC A TCGCCCTAATCC^ j^; I ll^VULJ kGcTCgL^A 

105* OEOMKRRI I A L i L ERREROKA AA I EETKRL I 0 L F R R S 

1391 ACcC3roTOGATCGAGAAGAACG<XKrCCCTCTA^^^ 

1> V Wl EKNGPVYRI RDLVRGKKVTI OTGYPTKT SAKNAM 



1622 ACCGCCGTGAACTTOClMKKX^ArCOGATCTO 

77» T A V N S E G N R I rnhllpilghltloelogovtoowvndl e 

173 8 cxyrccGCCTCxasprcgrGGCxroa^^ i w 1 vx^ aagcc g ct vv x^^ccgaagaccat^ 

11S> A gvgpwpe S T RGRRKPLAA KT 1 SNCHGLLHT I CGAA I A 

1ES4 CGAAAgOGATCACCgTCAAlXA V l W l V lT ^^ ■ ■. A7CACTCC UJ /:i lW L > l 

1S4>A Kfll RLNPCSSTMLPRREPKEMKFLSDPEl G R L l TALPP 

1970 CACTOOCC^C G CrCOI C ATGCTGCTGgrTSGCG^ 
193* HWRPLVMLLV. ATGLRWGEAI GLRA 



2202 TCATCOCCGGAAAGAAAACTT^CGA CC. 1 i U l ^ l^ *CaXX aC C G AAXOg*XX^TOCTAA^^ 1 1 1 '1 rC7CC<j TCA AOGCCTOCGACGAAGCCCG L»L ' l 1 1 VLX, 

270* L I A GKKSOEVV F TA PKGGMVRTRNF RRl WVKACEEAGL P 

2318 OUi. I ' l ACtX^TTCACC^VlTXaaCACACTCA^ 
309* GLRI HOLRHTHAA1 L I 

24 34 GCTCTACGGGCA Ol 'i lAAV l Xj AOGAGGTCGACGAGGOGA 
347> LYGHLREEVOEGI L A A I EEAMAGVRAEDL EaELDEEL T 




266S TC^CCCCCTC^gCACOOCCt^^ 

2766 ATCCGTCAT AOOOGACAGGTAOOQGAACTCAACCCCOQ Gu , HI T ! \^ l \X CGT1 2^ TCATK U M^V ^ AO. llAA.t.Vl VlVL LU l-i^-l ^U ATCTCOOCG 

2884 GCGMX?T&TCQCGCM&1\JJUJ^ 1 l l^A^Oy ill V^ TCT^OOCTLag^^ 

1000 CCAOCCTJCt^COCC-I^T^ 

J232 AllAXIlULVlCIOOGATCAOCCACCCCOOCa* ^ ^ 4 Ill AGATCXJlTCTACACttXX^A^^ 

,J48 CCCCTTCGACTOrGAOT^^ 1 W l U .U Wl^JLJ UCTrCCACPIOT 

U64 TUL^tXVW1TA.AC^TX^CTACAGT^^ 

JS80 AACwXATC>U>:W^WAl\X\*^^ 

1696 tvc'niai-rramAAAf i uA. i AJ *ccAocrccrofc^^ i^auammviv.^ 

»812 CGACTTOTCAq^XtX^arTCOC^ ^ 

J92B TCCTCX*TOCPg*ra^^ 

IW4 CCTTO^uriUWl\JUWlVTAl U. Vl l-^X^ OT 

1160 TGACCGACTACAGATCgXt^O^VIltA^A^ yj ji VA , ^ E^TCXtXXa^3CTCAOCTCO IIUULUA 1 VIIA^XXX J ICWCATOOGG 

1276 ATOQOCAT0OCOGA1 C HUJ l lA, niAAXlXX X^TGACATCACOXCa^^ 



Figure 7B 
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1 GGTACCCGACCGTGTCCCGGAACAACGAGTCGAGATACGGCGAGAGGAACACCCCCCGGTAGTCCGGGTAGACGGTGGGCGCGAAGGCGTAC 
93 GCGCCTTCGACGGTCAGCGGGCGGGCGGACACCGGCGGGTCAGCTCGTCACGTGTACGCGGGGGACGTACAGGATCCACTGTCCGCCAGCC 

<• STVHVRPVYLIWQGGA 
184 CGGCGGAACTCCTGCTCCTTCGCCATGATCTCGTCGGCGTGGTTCCAGGCGAAGAGCAGCGCGTAGTCCACCGCGTCGGGCGTGAACGCGTC 

<R RFEQEKAMI EDAHNWAFLLAYDVADPTFAD 
2 76 CGGGGTGCGCACCGGGATGTGCGTGCCGGGGGTGAGCCGGCCCTGCTTGGCCGGCGTCGTGTCGCACACCCAGGAGACCAGGTCCGGACCGA 
<PTRVPI HTG PTLRGQKAPTTDCVWSVLDPG I 
36 8 TGCCGCAGAAGTTCGTCACGGTGGCGCTCTTCGCCGTCGCGCCGTACGCCACCACCCGCTTGCCCTCGGCCTTGAGCGAGTTCAGCAGGGCG 

< GCFNTVTAS KATAGYAVVRKGEAKLSNLLA 
460 AGCAGGTCGGTGCGGATGCCCTCGACGTCGGCGGCGAACCTGTCGAGCCGGCTGCGG7CGGCGACCCCTCGGGCGTCCTCCTCGCCGATCAG 

<L LDTRIGEVDAAFRDLRSRDAVGRADEEGI L 
552 CGCGGCCACCCGCTCGGCCGGCTGCCGCGCGCCGGCACGGGCGATGGTGTAGCGGACCTCCCCACCGTGCACCGGAAGGCGTTCCACGTCGA 

<AAVREAP QRAGARAI TYRVEGGHVPLREVDV 
64 4 CGAGCGCGAAGCCGAAGCGGGCCGCCAGCGCCTGCACCGACCGCGCCGAGAAGAAGAAGAAATGCTCGTCGTAGATCTGGTCGAAGGACGTC 

< LAFGFRAALAQVSRASFFF FHEDYIQDFST 
73 6 TTGTCCAGGATGTCCCCGAGGTACGGGTCCTCGAAGACGAAGACCCCGTCCGGTGCCAGCAGCGCGTCAACGCCCCGGAGGATGGAGTCGAG 

<KDLIDGLYPDEFVFVGDPA'L LADVGRLISDL 
82 8 GTAGGGGATGTGGCAGATGGTGTTGGCCGCGAAGATCACATCGGCGGGGCCGTCGGTCTCCCTGACCCGGCGGGCGGTGGACTCCTCGAAGA 

<YPIHCITNAAFIVDAPGDTERVRRATSEEFF 
92 0 ACTCGGTGACCACCCGCACCCCGTGCCCACGGGCCACATCGGCCACCCCACCCGACGGCTCGAAGCCCAGGTGCCGCACTCCCGCCTCGTGG 

< ETVVRVGHGRAVDAVGGSPEFGLHRVGAEH 
1012 ACGGTACGCAGCATCACCCCGTCGTTGCAGCCGATCTCCACCACGAACGGGTCCGGGCCGGTGGCCTCGTGCTCCAGCAGGTGCCGCGCGGT 

<V TRLMVGDNCGI EVVFPDPGTAEHELLHRAT 
1104 GTCGGCGAAGTGCTTCTGCATCACGGAGGAGCCCGACGAGTGGTACGGGTAGTCCTGGTGGAACATCTCCTCCCGGGGCACCTCCTCCATGA 

<DAFHKQMVSSGSSHYPYDQHFMEERPVEEML 
1196 GCTGCACCATCGTGCAGCCCGCGCAGACCCCCACCGCCAGGTGGAAGAAGAACTCGTCCGCGAGCTGGTCCTCGGTGAGGAACCGGTCGGAG 

< QVMTCGACVGVALHFFFEDALQDETLFRDS 

12 8 8 AGCGGCTGCCGACCCAGGTCGAGGAACTGGGCGGTGGTGCCGCCACACGCCCGGCAGGTGCCGCGCCGGGGCGTGCCGTCGGCTGAATCGGT 

<L PQRGLDLFQATTGGCARCTGRRPTGDASDT 

13 80 CATCGCATTCCTTCCATGGATACCCCTGCCTCAGGCAGGGCGGATGTCAACGACGTCTCCTTGTCGGGATGGGTCGTCCAGCTCTCGGC 

< M 

14 6 9 TGCGGCGTGCCGGCCGGCGACTCAGAGCGCCGCGAGCACGTCGCGCAGCGCGTCGATCACCCGGTCCTGCGTCTCCGGCGGCAGCGAGGGA 

<• LAALVDRLADI VRDQTEPPLS P 
1560 TACATGGGGAGGGAGAAGATCTCGCCGGCCAACCGTTCCGTGGTGGGGAGGGAGCCGGTCTGGTAGCCGAGCTTGGCGAAGCCGGTCATGGT 

<Y MPLSFI EGALRETTPLSGTQYGLKAFGTMT 
1652 GTGCACCGGCCAGGGGTAGCTGATGTTGAGCGCGATGTCCCGCGCCGCCAACTGTTCGAGGATGCGGTCGCGGGCGGGGTGGCGGACCACGT 

<HVPWPYSINLAIDRAALQELIRDRAPHRVVY 
1744 AGACGTACCAGACGTGTTCGTTGTCGGTCGCCGTCACCGGCAGGGTCAGGCCGACCTGGTCGGCGATGTCGGCCAGGCCCTCCTCGTAGCGC 

< VYWVHENDTATVPLTLGVQDA I DALG EEYR 
183 6 CGGGCCACGGCGCGGCGCCCGGCGATGTAGTCGTCGAGCCGGCACAGCTTGCGGCGCAGGATCTCGGCCTGCACCTCGTCGAGGCGGCAGTT 

<R AVARRGAIYDDLRCLKRRLI EAQVEDLRCN 
192 8 GTGCCCCGGCGTCTCGACGACGTAGTAGACCTGTTCCATGCCGTAGTAGCGCAGCCGCCGCAGCCGCTCGTCGATCACGGCGTCCGCGGTGA 

<HG PTEVVYYVQEMGYYRLRRLRED IVADATV 
2 02 0 CCACGCCGCCGCCGTCGCCGTACGCGCCCAGCACCTTCGTCGGATAGAACGAGAAGGCAGCGGCGTCGCCCATCGTGCCGGCCAGCCGTCCG 

<VGGGDGYAGLVKTPYFSFAAADGMTGALRG 
2112 CCCCGACGCGCGCCGTGCGACTGCGCGCAGTCCTCCAGGACCTTGAGGCCGTGTTCCCGGGCCACCCGCAGCACCGGGTCCATGTCGACGCA 

<G RRAGHS QACDELVKLGHERAVRLVPD MDVC 
2 2 04 CTGGCCGTAGAGGTGCACCGGCAGCAGCGCCTTCGTCCGGGGGGTGACCGCCTCGGCCAGCAGCTCGGTGTCCATCAGGTAGTCGTCGGCGC 

<QGYIiHVPLLAKTRPTVAEALL.ETDMLYDDAR 

22 96 GG ACGTCC ACG AAG ACCGG CG T CG CG CCG AC CG CGTCG ATGG CG AG C AC CGT CGG CGCGG C CGTGTTGG AG ACGGTG ATG ACCTCGTCGC CC 

<VD VFVPTAGVADIALVTPAATNSVTIVEDG 

23 8 8 GGCCCGACGTCGAGCGCCTGGAGTGCGAGCTTGATGGCGTTGGTGCCGTTGTCCACCGTGACGCAGTGCGGCATGTCGTGATAGGCGGCGAA 

<PGVDLAQLALKIANTGNDVTVCHPMDHYAAF 

24 80 CTCCTGCTCGAAGCCGCGCACGCTCGCGCCGAGGATGAGGTTCCCGGACTCGAAGACCGTCTGCACGGCGTCGAGGAGGTCGTCCCGTTCCT 

<E QE FG RV S AG L I LNG S E FV TQVAD LLDD RE K 
2572 TCTCGTACTCCGGCAGGTAGCCCCACACTCGGATGGTCATCTTCGCCCTTCTACGCCGAGGTCCGGGAGCGGAGCGCGCGCACGCTCAT 

< EYEPLYGWVRITM <»ASTRSRLARVSM 
2661 GTAGTCGTTGTCCCGGTCGAGTCCAAGTGCCTGACCGCTCAGGTAGTCGACGGCGTCCACGTAGCTGTAGGGCTGCATGAACCCGCCGGCCC 

<YDNDRDLGLAQGSLYDVADVYSYPQMFGGAR 
2753 GCACGTCGCGATAGAGCCGGGAGAGTGGGTGCCCGGCCGTGTAGGCGAGGCCGCCGACCAGGCTGAGGCAGTCGTCCACCACCGCCGGGGCC 

<VDRYLRSLPHGATYALGGVLSLCDDVVAPA 
284 5 AGCTCGTTGACGGTCATCTTCGCGTACTGGAACGGGGTCATCATCCGTCGGCCGCGTTCGTCCGGGTCGCCGGACAGGTCGACCGACGCCGC 

<L ENVTMKAYQFPTMMRRGRED PDGSLDVSAA 
2 93 7 GTCGGCGTTGGTCAACGCCGCGCCGACCGTGGTGCGTAGCGCGTAGAGCCGGGTGTCCAGCCCGGCGACCAGCGCCCGGGCACCGGCCCGTG 

<DANTLAAGVTTRLAYLRTD LGAVLARAGAR P 
302 9 G CTCG C CGC CG CG C CC CGCG C AG AAAC CG AC CG CG ATGT CCCGGG C CG C CTGGG CG ATG C CGG CGTAG ATG CCG AG C ATGGTG ATCG AGCTG 

< EGGRGACFGVAIDRAAQAI GAYIGLMTI SS 
3121 ACCGTCTGCCCGGCCAGCACGGCGTCCCGCCGGGCCCCGACCGGGCCGCGCTCCAGCAGCTCGTCGGCCCGGACCGGGCACCGGTCGAAGAC 

<V TQGALVADRRAGVPGRELLEDARVPCRDFV 
3213 CACCTCCAGCGTCCCCGAGGCACGCATTCCCAGGCCGTCCCAGTTGTCCAGCACCGTGAGCCCGGGGGCGTCGCGGTGCACGACCGGCACGG 

<VE LTGSARMG LGDWNDL.VT LG PADRHVV PVA 
3305 CGAGGAACACCGAGCCGTCGTCGTCGCGCCGCTGGGCGTGCACGAAGAAGTGGGTCGCGATGGGCGCCATGCTGACCAGCACCTTGCGGCCC 

< LFVSGDDDRRQAHVFFHTA I PAMSVLVKRG 

33 97 GACAGCAGCCAGCCGCCGGCGCCGTCGGAATGCAGCTCGGTGACCACGCCGGGGGCGTCCTTGAGCGCGCCGCAGACGGCGGCCTCGCCCTC 

<S LLWGGAGDSHLETVVGPADKLAGCVAAEGE 

34 8 9 CGCCATCGCCCGCAGCAGCCGCTCCGCCATCGCGCGCACCGGCGGCGTGCCGTGCTGCCACTCGTAGGTGAGGGTGAGGCCCCGGCTGAGCT 

<AMAR LLREAMARVPPTGHQWEYTLTLGRSLQ 
3581 GCACGTGCCAGGCCAGCGCGGTGGACGCGTCGGCCTCGGCCAGCCGCATCAGCGCGGTCGCCACGTCGTACAGCCGGGTCAACCCCAGGCCG 
<VHWALATSADAEALRMLATAVDYLRTLGL G 

Figure 11 A 
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3 6 T 3 CCCAGCTCGGCGGGAACGGTGGCGCCCATCAACCCGAGCTTCGCGAACTGCTCGAACGCCTCCACCGGGAAGGTGCCGGTGCGGTCCCGGTC 
<G LEAPVTAGMLGLKAFQEFAEVPFTGTRDRD 

3 76 5 GGCGGCCTCCGCACTGATTCTCGGGATCACGCCGGCAAGCAGGTCGACCACCGTGCGCCCCGCCGGCGTGAGCGGCGCGCGAAGATCCGCCG 
<AAEAS I R P I VGALLDVVTRGA PT LPARLDAA 

3 6 5" CCACCCATCTCTCCTCGGTCAGATTAGACATCGCCTGCTTCCGTTCGCGCTGTGCCGAACCTGTCGCTATCAGGGTGCGCGGCGATCACC 
< V 

3 94 7 AATTGCTGGCTGATTGTCCACCGACGATGCTCGACAGGGATACCCAGAATAGGCGGCAACGGCTTGGCGAAACCCCTGTCGTGCGCCGGAGT 

4 0 3 9 TCGCCAAGTCTAGTTGGATCACTCGAGCTTCCCCCGCAGAAGCCGTGAACCATGGGCCAGCGGTTGACGTCGCTATATATTCGGCCGACACG 
4131 TGCGGAGGACTCGTGAAGATACTGTTCATCGCAGGACCGACGAAGTCCAGCCTATTCGGCCTGGCCCCACTGGCAATCGCCGCCCGGATG 

>VKI LFIAGPTKSSLFGLAPLAIAARM 
4 2 21 AGCGGGCACGAGGTCGTGATGGCTTCCACGCAGGAGGTCGTACCGGCGACGATGTCCGTCGGGCTGCCGGCCTTCCCGCTGGCGGCGCTGAC 

>SGH E VVMAS TQEVVPATMS VG L PAF PLAALT 
4 313 CCTCGCCGAGCTCATGACCACCGACCGGGCCGGCGATCCGCTGCGCATCCCGGCCGAGGACGCCGCCTTCGTCCCCTTCGTCGGCCGGATGT 

>LAELMTTDRAGDPLRI PAEDAAFVPFVGRM 
4 4 C 5 TCGGCCGGCTGGCGGCGATCAGCCTGGATCCGCTGCGCGACCTGGTCGGCGGGTGGCGGCCCGACCTGATCGTCGGCGGCCCGCACGCCTAC 

>F GRLAAI S LD PLRDLVGGWRPDLIVGGPHAY 
4 4 97 GCCGCGCCGATCCTGGCCACCGAACTTGGGGTGCCCTGCGTGCGGCACCTGCTCACCGGCAACCCGGTGGACCGCGAGGGCACCCATCCGGG 

>AAP I LAT E LGVPCVRHLLTGN PVDREGTH PG 
4 5 8 9 GGTCGACGAGGAGCTGCGGCCGGAGCTGGCCGCGCTCGGCCTGGCCCAGGTGCCGCCGTTCCACCTGGCCCTGGACATCTTCCCGGCCAGCA 

>VDEELRPELAALGL AQ VPPFHLAL DIFPAS 
4 6 61 CCCGGATCGACGACGTCCCGCCGGCGCAGCCGGTGCGACCGCTGCGCTGGATTCCGACCAACCAGCAGCAGCCGGTGGCGCCGTGGATGCTC 

>TRIDDVPPAQPVRPLRWIPTNQQQPVAPWML 
4 773 TCGCGCGGGCCGCGTCGCCGTGTCCTGGTCACCGCCGGCAGTCTGGTCACCACCACCCACAACTTCGACTTCCTCCACGGACTGGCCGGCAC 

>SRG P RRRVLVTAGSLV TTTHNFD FLHG LAGT 
4 8 6 5 CCTGGCCGAGCAGGACGTCGAGGTCGTGGTCGCGGCGCCGCCGGAGGTGGGTCGGGCCCTGCACGACGTGCCGGGTGTGCGGCACGCCGGCT 

>LAEQDVEVVVAAPPEVGRALHDVPG VRHAG 

4 9 5 7 GGCTCCCGCTGGACGTGGTGCTGCCCCACTGTGACCTGATCGTGCACCACTCCGGCACGATGACCGCGCTGACCGCCTTGAACGCGGGGGTG 

>W LPLDVVLPHCDLIVHHSGTMTALTALNAGV 

5 04 9 CCCCAGCTGATCGTGCCGCAGGAGAGCCGGTTCATCGAGTGGGCGCGCAACCTGTCGACCCTGGGCGTGGCGCAGACCCTCGCGCCGGGCGA 

> P Q L I VPQES RFI EWARNLS TLGVAQTLAPGE 
5141 GGACACGCCGGAGGCCGTGGGCAAGGTCGCCCGCCTGCTGCTGGAGGATCCGGTCCACGCCACCAGCGCCGCCGCGATCGCCCGGGAGATCG 

>DTPEAVGKVARLLLEDPVHATSAAAIAREI 
5 2 3 3 CCGAGATGCCCGGCCCCACGGAGGTCGTGGGCCAGCTCACCGAGTTCGCGACCCGGGGCCTGACATGCGCGTCCTCGTGACCGGCGGAGCC 

>A EMPGPTEVVGQLTEFATRG L T C A S S • 

> V T G G A 

53 24 GGGTTCATCGGCTCCCACCTCACCGACGCGCTGCTCGAACGCGGCGACAGCGTCACCGTGCTCGACGACCTGTCCACCGGGCGGCCCGAGCG 

>GF I GSHLTDALLERGDSVTVLDDLSTGRPER 
5416 GCTGCCCGCCGGGGTGCCGCTGCACCACGGGTCGATCACCGACCGGGCCGGGTTGACCCGGCTGGCCGAGCAGTGTCGCCCGGAGGTCATCT 

>LPAGVPLHHGSITDRAGLTRLAEQCRP E V I 
5 5 0 S GCCACCTGGCCGCCCAGGCGGACGTGCGCAACTCGGTGGCCGACGCCACCTCGGACACCGGGGTCAACGTGGTCGGCACCGTCAACGTCCTG 

>C HLAAQADVRNSVADATSDTGVNVVGTVNVL 
5 600 GAGGCCGCCCGGGCCATCGACGCCCGGGTGGTCTTCGCCTCCAGCGGCGGCGCCCTCTACGGGGAGGTCGACGAGCTGCCCTCCCCCGAGGA 

>EAARAIDARVVFASSGGALYGEVDELPSPED 
56 92 CGTCCGGCCGGCGCCGTGGGCGCCGTACGGGGCCGCCAAGTACTGCGCGGAGCAGTACCTGGCGCTCTACAACCGGCTCTACGGCTCGACCC 

>VRPAPWAPYGAAKYCAEQYLAL YNRLY GST 
5784 ACGCGGCGCTGCGGCTCGGCAACGTGTACGGGCCACGCCAGGACCCGACCGGCGAGGCCGGGGTCGTCTCGATCTTCTGCGGCTGCCTGGTG 

>H AALRLGNVYGPRQDPTGEAGVVSI FCGCLV 
5 876 GCCGGGCGCCGGCCGACGGTGTTCGGCGACGGCGAGCAGACCCGGGACTAGATCTACGTGGCCGACGTGGTGGAGGCGTTCCTGCTCGCGGT 

>AGRRPTVFGDGEQTRDYIYVADVVEAFLLAV 

5 96 8 CGGGCACGGTGGCCCCGGCCTGTGGAACATCGGCACCGGGACCTCCACCAGCATCCGCAAACTACTGGACCTGGTCGGCCGCACCGCCGGGC 

>GHGG PGLWN IGTGTSTS I RKLL DLVGRTAG 

6 060 GCGTCCCGGACCCCCGCTTCGAGCCACCCCGCCTGGGCGAGCTGAAGCACTCCGCGCTGGAGGTGACCCGCGCGGCCCGGGAGCTGCGCTGG 

>R VPD PRFE P PRLGELKHSALEVTRAARELRW 
6152 GCGGCCCGAACGAGGCTCGCCGACGGCATCGCGAAGGTCTACAAGTGGGTCGAGGCGGACGAACCGGTCCGGGGGGAGCGATGACCCGCG 

> M T R 

>AARTRLADG IAKVYKWVEADE P V R G E R • 
6 24 2 AGGGGTCAACGCCGCCGGTTAGGGTCGCCACCATCACGGTCGGCACCAACGAGATCCGTTGGCTGGACCGCGCGCTCGGCTCGCTGCTCGCC 

>E GST PPVRVATITVGTNEI RWLDRALGSLLA 
6334 AGCGACACGACCGGCTTCGAGCTGACGGTCTTCTACGTGGACAACGCCTCGGCCGACGGCAGCGTGGCGCACGTCATGTCGGCGTTTCCCGG 

>SDTTGFELTVFYVDNASADGSVAHVMSAFPG 
64 2 6 CGTCCGGGTCATCCGAAACCCCCGCAATCTCGGCTTCACCGGCGCGAACAACGTCGGCATGCGGGCGGCCCTGGCGGAGGGCTTCGACCACA 

>VRVI RNPRNLGFTGANNVGMRAALAEGFDH 
6 5 1 e TCTTCCTGGTCAACCCGGACACCTGGACACCGCCGGGGCTGGTCCGCGGGCTGGTCGAGTTCGCGCAGCGGTGGCCGCAGTACGGCGTCATC 

>I FLVNPDTWTPPGLVRGLVE FAQRWPQYGVI 
6 610 GG C CCG TTG CAGT ACCG CT ACG AC CCGG CGT CG ACCG AGTTG AC CG ACTTCAACG ACTGG ACG CAGGTCG CC CTCT ACCTGGG CG AG C AG C A 

>GPLQYRYDPASTELTDFNDWTQVALYLGEQH 
6702 CACCTTCGCCGGCGACCTGCTGGATCATCCCTCGCACGTCACCGCGACGGTCCGCGACCGCGCGCCGCGCACCCTGGAGCACGCGTACGTGC 

>TFAGDLLDHPSHVTATVRDRAPRTLEHAYV 
6794 AGGGCTCGGCGCTGTTCGTCCGGGCCGCCGTGCTACGCGAGGTCGGCCTGCTCGACGAGGTGTTCCACACCTACTACGAGGAGGTCGACCTG 

>Q GSALFVRAAVLREVGLLDEVFHTYYEEVDL 
6 8 8 6 TGCCGGCGGGCCCGGTGGGCGGGCTGGCGGGTGGCGCTCCTACTCGACCTCGGCATCCAGCACAAAGGCGGCGGTGGCACCGCCGCGAGCGC 

>CRRARWAGWRVALLLDLG I QH KGGGGTAASA 
6 978 GTACAGCCGGATACACATGCGCCGCAACCGCTACTACTATCTGCTGACCGATGTGGACTGGCCCCCGGCCAAGGCCGCCCGGCTCGCCGCCC 

> YSR I HMR. RNRYYYLLTDVD WP PAKAARLAA 
707 0 GCTGGCTGTTCTCCG ACGTCCGTGGGCGGGGCGTGACGGGTCGGACGAGCGCGGGCGTCGGGG CGCGGGAGACCTTCGTGGCGCTCGGGTGG 

>R WLFSDVRGRGVTGRTSAGVGARETFVALGW 
716 2 CTGGCCCGCCAGGCCCCGGTGATCCGGGAACGTCGTCGGCGGCACCGGCTGCTGCGGGCACGAGGGACGGGCGTGGACCGCGCCCGAGAGCG 

>LARQAPV I R ERRRRHRLLRARGTGVDRARER 



Figure 11B 



21/55 



7 2 54 GAAGGAAACCGTGCGGGGATGAGCAGGCCACGGATTCTCGTCGCGGGCAACTTCCACTGGCAGGCCGGGTTCAGCCAGACCGTCGCCGCGT 
>KETVRG» 

>msrprilvagnfhwqagfsqtvaa 

73 4 5 acgtgcgggcggcccgggaggccgactgcgaggtgcggctctgcggcccgctgtcccgggtcgacgccgagacggcccggcacctgccggtc 

>y vraareadcevrlcgpls rvdaetarhlpv 

74 3 7 gagccggacctccgctggggcacccacctggtgatcatgttcgaggccaagcagttcctcaccgaggcgcaactggacctcgtcgaggcgtt 

>e pd lrwgth lvi m feakqf lt eaqldlveaf 

75 2 9 cccccgacagcgccgggccatcgtcgacttcgacgggcactggggtgccgaggagggcggggacggcgacagcgcgtcgggccggtactccg 

> prqrrai vdfdghwgaeeggdgdsasgrys 

76 21 cggagagttggcggcggttgtactcgaccctgagcgacctgatcctgcaaccccggctgggtccgctcccggccggcgcccggttcttcaag 

>a eswrrlystlsdlilqprlgplpagarffk 
7713 tgcttcggcctggcagcgccggtgcggcacccgctggaactgggcaccggcgcgcagtcgcgcccgtacgacctccagtacatcggcagcaa 

>cfglaapvrhplelgtgaqsrpydlqyigsn 
7 8 0 5 ctggtggcggtgggagccgatgaccgagatggtcgaggccgcggcggcggcccggccgccgctgcgccggctgcgggtgtgcggacgctggt 

>wwrwepmtemveaaaaarpplrrlrvcgrw 
78 9 7 gggacggcggcagttgcgcgggcttcgaggaggcgacgctcagcgagccgggctggctgcgggcgcgcggcgtcgaggtgcatccgcccgtg 

>wdggs cag f e eatls epgw lrargvevh p pv 

7 98 9 ccgttcggccacgtggtcgagcagatgggccggtcgctgatctcaccggtcctggtgcggccgctggtcaccagcaccggcctgttgacccc 

>pfghvveqmgrslispvlvrplvtstglltp 
80 81 ccggatgttcgagacgctggcctcgggcagcctgcccgtgctcccggtcgccgcgaagttcctcgcgccggtctacggcgacgaggcggaac 

>rmfetlasgslpvlpvaakflapvygdeae 
817 3 acctg atgctcgg cg acg acccgg c cgg aacgctg ag c cgg ctct cgg c cg ag c acg aacggtacgg acg actggtcgg tg ag attc agg ac 

>hlmlgddpagtlsrls aeherygr"lvgeiqd 

82 65 cggctccgcgtcgagtacggctaccctcgcgtcctgcgggacctgctcgatctgctggcctgaggaatgaggagcagatgacccccctg 

> M T P L 

>RLRVEYGYPRVLRDLLDLLAo 

83 54 CGGATCGCGATGGTCAACATACCGTTCCGGTTGCCGAGCGACGAGCGGCAGTGGATCACGGTCCCGCCGCAGGGGTACGGCGGGATCCAGTG 

>RIAMVNI PFRLPSDERQWITVPPQGYGGI QW 

84 4 6 G ATCG TGG C C AAC AAG ATC AAGG G C CTG CT CG AACTCGGG C ACG AGGTGTTCCTG C T CGG TG C C C CGGG CAGTC CG CGT ACGC AT C C ACG CC 

>IVANKIKGLLELGHEVFLLGAPGSPRTHPR 

8 5 3 8 TGACCGTGGTGCCGGCGGGCGAGCCCGAGGACATCCGGGCATGGTTGAAGTCCGCTCCGGTGGACGTCGTCAACGACTACAGCTGCGGCAAG 

>L TVVPAGEPEDIRAWLKSAPVDVVNDYSCGK 
863 0 GTGGATCCGATCGAGCTGCCCCCGGGGGTCGGCCTGGTGGCCTCGCACCACATGACCACCCGCCCGTCCTATCCGGCCGGCTGCGTGTACGC 

>VDP I ELP PGVGLVASHHMTT R PSYPAGCVYA 
8 722 CT CG AAGG CGC AG CGGG AG C AGTG CGGCGG CGG CG CGG ACG CC CCGGT CATC C CG AT CGGGGTGG ATCCGTCG CTCT ACCG CCC CGG CG AC C 

>SKAQREQCGGGADAP.VIPIGVDPSLYRPGD 
8814 GCAAGGACGACTTCCTGCTCTTCATGGGCCGGATCTCCCCGTTCAAGGGCGCGCTGGAGGCGGCCGCGTTCGCCCGGGCCGCCGGCCGCCGG 

>R K D D F L L F M G R I S P F KGA L E A A A FA R A A . G. R _ R 
8906 CTACTGATGGCCGGTCCGGCCTGGGAGCCGGAGTACCTCGACCGGATCATGGGCGAGTACGGCGACCACGTCACCCTCGTCGGCGAGGTGGG 

>LLMAGPAWEPEYLDRIMGEYGDHVTLVGEVG 

8 9 98 GGGTCAGGAACGTATGGACCTGCTCGCCACGGCGGCTGCCATCCTGGTGCTCTCCCAGCCGGTGCCCGGCCCGTGGGGCGGCACGTGGTGCG 

>GQERMDLLATAAAI LVLSQ PVPGPWGGTWC 
90 90 AGCCGGGTGCGACCGTGGTGTCCGAGGCGGCGGCCAGCGGCACCCCGGTGGTCGGCACGAGCAACGGCTGCCTGGCGGAGATCGTGCCGGCC 

>E PGATVVSEAAASGTPVVGTSNGCLAEIVPA 
918 2 GTCGGCGAGGTGGTGGGCTTCGGCACCGGCTTCGACGAGCGGGAGGCCCGAGCGGTGCTGTCCCGACTGCCGTCGCCCGCCCAGGCGCGGAA 

>VGEVVGFG TG FDEREARAVL S RLPS PAQARK 

92 7 4 GG CCG CG ATCCGGTGCTGGGGG C ACGTGG AG ATCG CCCGGCG CT ACG AGG CGGTGT ACCG CG ACGTG CTGG CCGG CG CG CGCTGGTC CTG A 

>AAIRCWGHVEIARRYEAVYRDVLAGARWS® 

93 65 GCCGGCCCGGGCGGCGGCTACGGTCGCGACCGTAGGGGGTGCCCGCCGCACGCGGAAGCGCCGGTGTCGGCGGTCCGACACCGGCGCGCCCG 

94 5 7 GCCAGGTCAGCTCCGGTCGTGCAGCCAGGGTCGCGGCTCGGGGTCGGTCACCTCGACCGGCTGGCTCATGAACACCAGGACGTACGCGCGG 

<o SRDHLWPRPEPDTVEVPQSMFVLVYAR 
954 8 CGCGGCTGGTCCGTCTCGTTCGGGCCGGCGTAGTGCGGCGCACGGAAGTCGTGCATGACCGCCTCGCCCGGCCGCAGCGGGCAGGCGACGCT 

<R PQDTENPGAYHPARFDHMVAEGPRLiPCAVS 
964 0 GTCCGTGTCGTCGACCTCGTCCGTCATCAGACCTTCGATGCGGTCGTCGTGGTTGATGTGATGGTGCGGGAGCACCCCGCCCCGGTGCAGGC 

<DTDDVEDTMLGE I RDDHNIHHI^PLVGGRHLG 
973 2 CCGGCAGGTACTGGAGACAGCCGCTGGACACCGTCGCCTCGTCGAGCGGGGTCCAGATGCTCAGGCCGCGACGGGACCAGCGCGGGTCCATG 

< PLYQL.CGS SVTAEDLPTWISLGRRSWRPDM 
982 4 TACGCCTCGTCCTGGTGCCACGGAGTGGGTGCGCCGTAGCGCGGCGGCTTGAGGATCGCGTGGCCGTAGAAGTCGAGTTCGTCCTCGGGGAT 

<Y AEDQHWPT PAGYRPPKLIAHGYFDLiEDEPI 

9 916 GTCGAGGAAAGCGGAGGCGATGGCCCGGCATCGCGCGAAGTGCGCGGTCTCCACCAACTCCGGCAGGTATTTCTCCGGCCGGACGATCTGCG 

<DLFASAIARCRAFHATEVLE P LYKE PR VI QP 
10008 GGAGACGGGCAGGCGCGGCGTCGTCGCCACGGCCGGCGATGTCCCGGTAGTCGCCGGTGTCGGGCGACGCGTGATCGGCGAAGAGCCGGTCG 

< LRAPAADDGRGAIDRYDGTDPSAHDAFLRD 
1010 0 TAGGCGGCCCGGAGCCAGGCGACCTCGGCGTCGTCGGCGAGCTGCGGGAGAGTCACGAAGCCATCGCGCCGGTAAGCCTCCAGCCGACGGTC 

<Y AARLWAVEADDALQPLTVFGDRRYAELRRD 
10192 GACGACCTCCGCACCAACAGTCCCCACGGCCATTTGACCACCTCTCGGAATAGCCTGTCCGCGAATAAACCATACGGTAGGAACAGCGCG 
<VVEAGVTG VAM 

102 8 2 GCGATACCGCTCCCGAGCGGGAAATAGGGATTCGACTAGTATTCGGTCCGCGCCGCTGCCAGAACGGCACGCGCTCTCGATTGTCCATTCAT 

103 74 CCCCGTGCGAGACTCGCCTCGATGTCCTCGATGTCGGTGGGGGGTTTGGGATGACCGGGCACAGCGCCGTCGCGCTGGACGTCGGCGGGGT 

>MTG H S AVALDVGGV 

104 6 5 CGT CTACT ACG ACG AG CCGTTCG AG CTGG CCTGGCTC C AGG ACAC CTTCG AC CG CCTCC AGG CC ACCG ACCCG ACG CTCGACCTGCGTGCGT 

>VYYDEPFEL.AWLQDTFDRLQATDPTLDLRA 

105 5 7 TTCTGGAGCACGTCGAGCGGTTCTACCACTACGGCGAGGGCGACCCAACCGGCCGGACCTGGCTCCACTCGGAGGCCGCCGCGCTGAGCTGG 

>FLEHVERFYHYG.E. GDPTGRTWLHSEAAALSW 
10649 TCGCGGGTCCGGCAGTCCTGGGGCGAGCTGGCCCAGGAGATTCCCGGTGCCGTTCGCGCGGTCACCAGGCTGGCCAGGGAACTACCCGTCGT 

>S RVRQSWG ELAQE I PGAVRAVTRLARELPVV 
10741 GATCGTCGCCAACCAGCCCCCCGAGTGCGCGGACGTACTGGCCCGGTGGCAGGTCAGCCAGGTCTGCCGGGAGGTGCTCCTCGACTCCCTCG 

> I VANQPPE CADVLARWQVSQVCREVLLDSL 
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10 83 3 TCGGGGTGGCCAAGCCCGACCCGGCCCTGCTCGGGCTCGCCCTGCGGCGGCTGGCGATCCCGCCCGCCGAGTTGCTGGTGGTGGGCAACCGG 

>V GVAKPDPALLGLALRRLAI PPAELLVVGNR 
1092 5 ACGGATCACGACGTCCTGCCCGCGCTCGGGCTCGGTTGCCCGGTGGCGTTCGTGCTGCCCGATCCGGCGTACCGCCGGCCGCCGGGCGTCCA 

>TDHDVLPALGLGCPVAFVLPDPAYRRPPGVH 
11017 TCCGGACCTGGTCCGGGTCTACACGGAGCTGAGGGCGTTCCGCACCGGCTCCCCGCCCGCGGACGCCCGGGTCACCACCGTGGCGTCCCTGG 

>PDLVRVYTELRAFRTGSPPADARVTTVASL 
1110 9 CGGCCCTGGCCGACTCTCCCCTGACGAGTGCCACCCCGCGTTCGAACGCCGGCACCGGCGGACTTTGACGAAGGAGTGCAGTTGCGACGCC 

>A ALADSPLTSATPRSNAGTGGL* 
112 00 CCGCAGCGGTGGTCGGCGCCACCGGCTTCATCGGCTCACGCCTCGTCTCCCGCCTGGCCGAGGCCGGGCATCCGGTGGCGCGCTTCAGCCG 

112 91 TGCCGCCCCACCCGTCGTCGACGGCCGGCCCGCGCCGGGGCTCCGCGAGGCGCAGGTCGTCTACTTCCTCGCCGCCCGGCTGAGCCCGGCGC 

113 8 3 TGGCGGAGCAGCAACCGGAACGGGTCGTCCGGGAACGCGAGTTGTTGCTGGACGTGCTAAGTGCGCTGGCGGGGGTGGACCACCGGCCGGTG 

114 75 TTCGTCCTGGCCAGCTCGGGCGGGGCGGTGTACACGCCGACGGTGTGGCCGCCCTACCACGAGCGGTCGGCCACCGGGCCCGCCTCGGCGTA 
1156 7 CGGCCGGGCGAAGCTGCGGCTGGAACAGGAGCTGCTGCGCCACACCGACCGGGTGCAGCCGGTGGTGACCCGGCTGAGCAACGTCTACGGTC 
1165 9 CGGGGCAGCGGCCGACCCCCGGGTACGGTGTCCTGTCACACTGGCTGGAGGCCACCGTGCGCGGAGAGCCGATCCGGCTCTTCGGCGATCCG 
11751 GCCGTGGTGCGGGACTACGTACACGTCGACGACGTCACCGCGATCATGGAGGTCATCGCGCAGCGGGCCGGTGACGGCGACCGGGACCGGCT 
1184 3 GCCCACGGTCGTGAACGTCGGCTCGGGCCTGCCCACCTCCCTCGCCGAGTTGCTCCAGACGATGTCCACGGTGGCCGGTCGTGAGCTGGAGG 
1193 5 TCATCCGGGACGTCCGCCGGCAGTTCGACCATCGGGGCAACTGGCTCGACACCACCCTCGCCCGGGAGACCCTGGGCTGGCAGGCGCGGATC 
12 027 AGCCTCCCCGACGGCGTCCGCCAGTGCTGGGAGGCCGTCCTCACCCGGGCGGGCGGCCCGGGGGGTTCCCCGGCCCGACCGTCAGCCCGGC 

> S L PDGVRQCWEAVLTRAGG PGGS PARPSAR 

< • G P 

12118 TCGGGAGAGCGTCTCGGGGGCGGGAACCGCCGCAACCGCGCCCTTCGCAGCAGTTCGTGGCTCAACCCGGCGGCGGTCGCCGCGGTGTAGCC 
>LGRASRGREPPQPRPSQQFVAQPGGGRRGVA 
<E PSRRPP P FRRLRARR LL EH S LGA ATAAT YG 

junction marker 

12210 GAGGGCCAGTGGCAGGGCGAGCCGGGATTCCGGGGCCGTACGGTGCCGGTCCCATTCCTTGCGCAGGCCGGCCCGTGCCTGGTTCCGTCCGG 
>EGQWQGEPGFRGRTVPVPFLAQAGP.CLVPSG 
<LALPLALRS EPATRHRDWE KRLGARAQNRGA 

123 02 CCTCGCACCGCCCCTGCCAGTACGCCCGCCGCAGCAGGTACCGGGGGGTCAGCCGGCCCGGGTCGATGTCATGGGTGACC 

>LAPPLPVRPPQQVPGGQPARVDVMGD 
< ECRGQWYARRLLYRPTLRG PD I DHTV 
12382 GCGTGGTCCGGGAGCAGTTGCTCGCGGGCGCCGGCGGCCTTCATGGCGCTGATGAAGGAGGTGTCCTCC 
>R VVREQLLAGAGGLHGADEGGVL 
<A HDPIiLQERAGAAKMAS I FSTDE 

124 51 CCTGACTGGAGGTTGCCCCCGGTACGGCTGAGGGCCAGGTCGAAATCCAACCCGTGGGCGTGCGCGAAC 

>P • 

<G S QLNGG T R S LA LD FD.L G HAHAF 
12520 GCGGAGTCCACCCCCATGCACGCGCCCCAGATCTTGATGTTTCCCTGGTCACGGTGCCAGCCGACCAGGTGGAACTGGCCGGA 
<ASDVGMCA r GWIKINGQDRHWGVLHFQGS 

junction marker 

12 603 GGTG ACGT ACC ACGGCAG ACG CAG CG CGGGGCGGG CC AG C CGGGTG C CG AC C ACGTG CG CGCCGTCG CG CAGG CTCTTGCGG ACGGCGTCG A 

<TVYWPLRLAPRALRTGVVHAGDRLSKRVADV 
12695 CGGCAGCGGCGTCGAGCCGCACGTCGTCGTCGACGAACATCAGATGGTGGTGCGGCCAGCGGGCGAGCATCGCGTTGCGGGAGGCCGACAGG 

<AAADLRVDDDVFMLHHHPWRALMANRSASL 
12 78 7 CCATTGGTGGCACCGAGGATGCGCATGGTGCCGCCGGCGGCCCGGACCTCCTCGGCGACCTCCTCCGCCTCGGCCGTGACGGGCCGGTCCAG 

<G NTAGLI RMTGGAARVEEAVEEAEATVPRDL 
12 879 CAGGACGTAGTACTCGTCGCCGGAGAGCTGGGCCATGTTGTGCGCGAGGTGTTTCCTGACGTTCTCCACCCGGAACGCGCAGATCGCCACCA 

<LVYYEDG S LQAMNHALHKRVNEVRFAC IAVV 

12 971 C C ATCGGGTGGTCGG ACGG AT CG CGGCTG ACC AC AG AGG CGTTGTT CGG C AT CGTCCCT CG AC ATG AG CGTTGTGGGG C CGGCGGGG AGA 

<MPHDSPDRSVVSANNPM 
13061 GGCGGCCCACCGGATCGCGCTCCGCGGCAGCGTGCCGCGGCCCCGGCCGCCCCACGGACAGGTGGCGCACCGCAGCGCGGCGGGCCGTCCGG 
13153 TGAGCGGCCGGGGGCGGAGCTCAGGCCGCTTCGTCCTTGCTGACGGTCGGCGGCCAGGTGCCCGCCGCGAGGATGCCCATCGCGTACGCCTT 
1324 5 CGCGATCAGGGCCGGCCGGTTGGGCACTCGCAGGTTCTGCAACAGCTTGCTGACGTGGTACTCGACGCCCTGACGGCTGAGGTAGACCTTGT 
1333 7 TGGCGATATGCACCGTCCGTTCGCCTGCGGCGATGCTTTCGATGATGCGCGCGTCCAGATCGGACAGTGAGAACTTGAGATTCACAACGCCC 
134 2 9 CCTTGATTGTGGATCTGTTGGCTCGTACGCGGCCAGACGTCATCGCCCGGACACCCCCCTGAGGTGCCGGTGGACGAGTGGTCGCTTCCGCT 

13 521 GGAAGACCGTCAGCGGCGGAAGGCCGGGTGCCCGCCACTCGTCGTCCTGCGCTTGCGGAGCTGTCTCAGTGTGCGGCGATGCCCCCGGCACG 
13613 TAGCCCGCCTGACGGGCCTGTCGGCTTGGGCTCCAACTTCGGCAACCCATCGGTTCGCGCTGCACAGAACCAGCGGAGGAACATTGAGTTTC 
1370 5 GAAGCCCTTCGCTACTGTCGATCGACATCACTGGCGAAATGAAACCGACCGGATTCGAGCCGATAGTTACGGAAAGTGACAATCGGCTGGCC 
137 97 CTGTCGCTCGCGCTGAACTCACCAATACGCCAAAAGCGTAGCCGGCCCACTGCGGAGCGTCCACCCCCCGAGGATATCGCCAGGCTTCCATG 
13 88 9 CAGAACTGGCAGGATCTTTCATCTCAGCCGCACCTGGCGACAAAACCCCTGCTCAAGACCATGAGTAAGCAGGCGCGGGAAATCCATGCAGT 

13 981 GACATGTGTCACTTTAGACAACCCAGCTCCAGCCAGGCCCACCACCCTGACAAAAGGGGCGGAATCGCGACCAGAGCGACACCAGCACATTC 

14 073 CTAGGGGATTCCTTAGTCTGGCGAGCGTCGATCCCCCACCCGTCAATCAATTGCCCGAACAAATTGCCCACCCGTCAAGATCAACCACTCCG 
14165 CACCGTGAGCAGGCCGACCGTCCGGCAATGCGAACGACGACGTCCACCCCGTCCCGGTAGCCTCACCGCCGCTGGGGGGGCACGGGCCCGGA 
14 2 57 CACCGTCCGGCGTCGTGATCAGCGGGTCGCTCGCCGGCTGTCGCCCCGCGCGGGGACCGGCATCGCGCGCGGGGCCGGCACGAAGATCTCCG 
14 34 9 GCGACCTGTCAGCCCTGCAAGCGGACCGGCATCGCGCGCGGGGCCGGAACCACCCGACGGTTCAGCGCCGGGACCGATCGGGAAGCCGACT 

<• RRSRDPLRS 
144 4 0 CGCGGCCTCGTCCCAGAGCCCGCCCACCGCATCCCGCAGCGATCGTCGGGGCCGCCACCCGAGCAGGTCACGCGCGGGCCCCGGGTCGACGC 

<AAEDWLGGVADRLSRR PRWGLLDRAPG PDVR 
14 532 GGGCCCAGTCGACGACCTCCACCGAGCCGGGCCGATCCGGCAACTCCACCACCTCGGTCGGCACCCCGCTCACCTCGATCAACATCTCCACG 

<AWDVVEVSGPRDPLEVVETPVGSVEILMEV 
14 624 AGGGAGCGGACCGGAACGGCCTCCCCCCGCCCCACGCCGATCACCCGCCCGGTGACGGACTCCCGCGTGGCCGACACCACCGCCTCGGCGAC 

<L SRVPVAEGRGVGIVRGTVSERTASVVAEAV 
14 716 GTCCCGCACGTCGACGTAGTCCCGGTGGGCGCGCAACGGAGACAACTCCACCCTGGCCGACCGGTCGCGGCCCGCCGCGTCGACCAGGCGAA 

<DRVDVYDRHARLPSLEVR ASRDRGAADVLRV 
14 8 08 CGACGACCCGCCCCAACAGGCTGTCGGGTGGCACGCCCGGACCCACCACGTTCGCGAGCCGCAGCACCGTCGCGTCCACCGAGCCGGCCCGC 

<VVRGLLSDPPVGPGVVNALRLVTADVSGAR 
14 900 GTCGCCGCGAGCACGGCCTGGGTGGCGGCCAGCTTCGCCCTGCCGTACATGCTCTCCGGCTGGGTGGGCACCGTGGCACCGGCGGGAGCCCC 

<T AALVAQTAALKARGYMSE PQTPVTAGAPAG 
14 992 GGGTGGCTCCTGGACGCGTTCCAGGACCGAGCCGAGGTGCACCAGCCTGGGCCGGCAACGGGTGCGCTCCAGCGCCGCGGTGACACTCCACG 

<PPEQVRELVSGLHVLRPRCRTRELAATVSWT 
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150 84 TCGGGATCGTGCAGCTCGACGGCAGCTCGGCGTCCGTCAGGTTCCACTTGCCGCCGGTGGCGTTGACCACCGCGTCGGGCTGCTCGGCGTCG 

< PITCSSPLEADTLNWKGGTANVVADPQEAD 
15176 AACACGGCGGCCAGCGCGGCGGGCTCCAGGGTGGCGACGTCCAGCGCCCGGGCCCGGTACGGCAGCCCCGCCGACGGGACGCGGCGGGCCAA 

<F VAALAAPELTAVDLARARYPLGASPVRRAL. 
1 5 2 6 S CACGAGGACGTCGTCGCCCCGGGCGGCAAGCGCGGCGCTCACGTGACGCCCGACGAAGCCCGTACCACCGACGACCACGACACGCCGCGCGC 

<VLVDDGRAALAASVHRGVFGTGGVVVVRRAG 
153 60 CCATCCGTACCTCCTGGGGATCAGTCTCGTGCGCCGGCGGCGTCCAGGCGACCGCCCGGCCCCTGACAGGTCACGGGGGCGCGCAACAC 

<M <o DRAGAADLRGG PGQCTVPARLV 

15 4 4 9 CCTGGCCCGGTCGCGAACCTCGTCCAGCAGCCGGGCCCGGGCGCGGATCGCCGTCACCTCCTCCGCCGGCTGACCGGCGGTGACGGCGCGGA 

<RARDRVEDLLRARARIATVEEAPQGATVARV 

15 541 CGAACTCTCGCATCGTGTTGACGAACTGGTCCTCGGCCGGGAAGGTCAGCTCCCGCGTCTCGTCCTGCCGCTCCACCCGCACCACCGGGTGC 

< FERMTNVFQDEAPFTLERTEDQREVRVVPH 
156 3 3 CAGGCCGGCGGTGGGGTGTACGCCCGGTCG ACGACGATCCGGCCGGCGCTTCCCCAGAGCTGGTACTCGCACCGGTAGGAGTGCTCGAAACC 

<W APPPTYARDVVIRGASGWLQYECRYSHEFG 
1572 5 GAAGGCGATCTGTGCGGTCCGCCCGTCCGGCGTACACAACAGCGCGGCACCCGAAACGTCCACGCCGCGATCGGGGTCCTCCCGGAGGGTGG 

<FAI QATRG D PTCLLAAGS VDVGRDPDE RLTA 
15817 CCGCCACCACCTCCGGCTCCTCCGGCAGGAAGAACCGGGCCGCGCCCAACGGGTAGACGCCCAGATCCAGCAGCGCTCCGCCACCCAGCTCG 

<AVVE PEE PLFFRAAGLPYVGLDLLAGGGLE 
1590 9 GGTCGGTAACGGATGTCGCCCGCAGGCAGCGGCGGAAACCCGAAGACGCCGGAGACCATCCGCAGCTCGCCGATCTCGCCGGCCGCGACCAT 

<P RYRI DGAPLPPFGFVGSVMRLEGIEGAAVM 

16 001 CCGGCGCACGAAGTGGTGCACCCCATGTCGGAGGAAGGTGAGGTTGTCCATCAACACGAGCCCACGCGACCGGGCCTGGGTCAGCACCGCCG 

<RRVFHHVGHRLFTLNDMLVLGRSRAQTLVAA 
16093 CGGTGTCGACCAGCCGGGTGGTCAGCGGCTTCTCCACCAGCACGTGTTTGCCCGCGGCCAGGGCGCGTTCGATCCAGGTGTGGTGCAGCCCG 

<TDVLRTTIiPKEVLVHKGAALAREIWTHHLG 
1618 5 G T CGGC AGCGG AATGT AG ACGG C ATCG ATGT C CGGG CGGT CG AG AACGG AC TGG T AG C CCTCGG CGG CGG CG C ACCCG AACT C CG CGG CG AA 

<T PLP I YVAD I D PRDLVSQYGEAAAC .GFEAAF 

162 77 GGCGCGCGCCTTGGCCAGTTCCCGCGCCGCGACCACCACGAGCTCCGCCTCCGGGACCCGCCTGATCGCCGGCAGGGCACGGCGGCGCGCGA 

<ARAKALERAAVVVLEAEPVRRIAPLARRRAI 

163 6 9 TGTCGGCGCAGCCGAGAACCCCGATGCGGACCGTCATCTCCGCCATCGGGTCACCACAGGCTGCGCAGGCAGGCCAGCAAGCTGCGGGCC 

<DACGLVG I RVTMEAM <• W LSRLCALLSRA 

16459 TCGATGTTGAGGTAGTAGCCGTGCCGGAGCAGGGCCTCCAGCTGGCGCACCGTCACCCAGCAGAACTCGTCCGGCACCTCGGTCGGGAAGTC 

cEINLYYGHRLLAELQRVTVWCFEDPVETPFD 
165 51 GTCGCCGGCGTCGACCAGGAGGTAACGGTTCTCCGACCGGTAGAATCGCCCGCCCTCCTCGGTGAGCACGGTGTCGTAGAGGACACGCTCGG 

<DGADVLLYRNESRYFRGGEETLVTDYLVREP 
1664 3 GGGCGGCTTCCAGCACCTCGGCCAGGAACAGTGGTCGGGGGCCGGGCTGGTTGTCCGGGATGCACTGCACCGTTGGGCCCATCTCCATCGCG 

<AAELVEALFLPRPGP.QNDPICQVTPGMEMA 
16 735 TCGAGCAGCCCCGCCTGGTAGCGCGCGTGCACCAGCAGGTGCGCCACTCCGTCGATTTCCTTGACCAGGAAGGCGACCACGCCCCGGTGCCG 

<D LLGAQYR.AHVLLHAVGD I E KVL FAVVGRHR 
16 82 7 CGGATACAACAGCGGCTGACTCCACCCGGTCACCTCGCGGTTGTCGATGCGCACCGTGACGCCGACCACGGAGAAGTGCCGGCCGTCGTCCC 

<PYLLPQSWGTVERNDIRVTVGVVSFHRGDDR 
16919 GCGCGATCCCGTCCGCGGTGTGCCGCCAGTCCGGCAGCCCTCGTAGCGGCACCCGCCGCACGGTCATCTCGTGCCGGCCCTTGGCCCCGGTG 

<AIGDATHRWDPLGRLPVRRVTMEHRGKAGT 
17011 AACCAGCTCAACACCGACACCAGGTCGTGGCGGCCGGGGGCGCCCGCCGCCGAGCGCACGATCGCCGCCACCGCCGGCGACGACGGCGCCTC 

<F WSLVSVLDHRGPAGAASRVIAAVAPSSPAE 
17103 GGT CTCCTGGG C CG CGG CGT AG AAGG C CG AGGG C AG ACATG AC AG C AC CGT ACG CG TG T CC ATGTTG ACC AGGCCGT C CACCCG C AG C AGCG 

<TEQAAAY FAS PLCSLVTRT DMNVLGDVRLLA 
17195 CG AG C AGTTCG CG C AG CGGG AG C C AC CGGTGGT AGTCG CCGG CCGGC ACGT CCTCGT CG AC CTGG ACC ACC ATGTTCCGGTTG CGCTTGCG C 

< LLERLPLWRHYDGAPVDEDVQVVMNRNRKR 
17287 AGGAACCAGGAGCCCTGCTCCGACTGCAGG ACGT CAACC AG C ACCCG ACCGGCCCCCGGGCGGGTGAAGTAGTCGAGGTACTTCGTGCCGCC 

<L FWSGQESQLVDVLVRGAG PRTFYDLYKTGG 
17379 GCCACGGTGCACCCGGGTGTAGTTGCTCCGGGTGGCCTGCACCGTCGGCGAGAGCTGCATGACGTTGATGTTGCCGGGCTCCACCTTGGCCT 

<GRHVRTYNSRTAQVTPSLQMVNINGPEVKAQ 
174 71 GGAGCAGGCAGTACGGTGTCCCGTCGACGACCTTGACGAGCATGCCGAGGATGCCGATCTCCGGCTGGTTGATGATCGGCTGGTGCCATTCG 

< LLCY PTG DVVKVLMGLIG I E P- QN I I PQHWE 
17563 CGCACCGCGCCGTAGGTGGTCTGGACGTGCAGCCCCTCGATCACGAAGAACCGGCCGCTCTCGTGCCCGAGGTTGCCGGTCACCGGGTCGAA 

<R VAGYTTQVHLGEIVFFRGSEH GLNGTVPDF 
176 5 5 CGCCCACCCGGGCAGCCGGTCCAGCGGCACGCGGTCCACCCGGCAGTAGGTCGACCGGGTCCGCTCGGCGAACCAGGAGAGGAAGTCCGGCC 

<AWG P LRD h PVRDVRCYTS RT R EAFWS LFD P R 
1774 7 GGACCCCCTCGGCGTGCAACGCCGACCACGAGCCGCCGTCGACCGGGCCGGGCCGGTGCGGGCCCGCGTCCCGGGTCAGCAACGGCCCGTCG 

< VGEAHLASWSGGDVPGPRH PGADRTLLPGD 
1783 9 GCGCGGACCTTCGGGTCGGGCGACGAATCGCTCACCACAGCTCGGCCAGTTCGGCGG AAAAGGCGTGGGAAGACCCGTCCGTGTCCGCCAA 

<ARVKPDPSSDSV 

<• WLEALEA S FAHS SGDTDAL 
17930 CAGGCGCAGATCGCTGTCGACCATCATGGCGACCATCTCCTCGAAGGAGACGGAGGGTTTCCAGCCGAGCCGCTGGCGGGCCTTCGTCGGAT 

<LRLDSDVMMAVMEEFSVS P KWGLRQRAKT PD 
18022 CCGCGCAGAGCAGCTCGACCTCGGCGGGCCGGATGAGCGACTCGTCCACCACCACGTGGTCCCGCCAGTTGAGGCCCACGTGGGCGAAGGCC 

<ACLLEVEAPRILSEDVVVHDRWNLGVHAFA 
18114 GCCTCGACCAGCTCGCGGACGCTGTGCGTGACCCCCGTGCCGAGGACGTAGTCCTCCGGCTCGTCCTGGGCCAGCATCAGGACCATGCCCCG 

<A EVLERVSHTVGTGLVYDE PEDQALMLVMGR 
18206 CACGT'AGTCGCCCGCGAAGCCCCAGTCCCGCTCGGCCGAGAGGTTGCCCAGGCGAAGCGAGCTGCGAATGCCCAGCTTCACCGCCGCCACGC 

<VYDGAFGWDREASLNGLRL S S RI GLKVAAVG 
18298 CCAGCGACACCTTGCGGGTGACGAACTCGGGACCACGCACCGGTGATTCGTGGTTGAACAGAATGCCGGAGACGGCATACATGCCGTACGAC 

< LSVKRTVFEPGRVPSEHNFLIGSVAYMGYS 

183 90 TCACGGTAGTTCTGCACCATGTAATGCCCGAACGCCTTGGCCGCGCCGTACGGGCTACGCGGATGGAACGGGGTCAGCTCATTCTGGACGGG 

<E RYNQVMYHG FAKAAGYPS RPHF PTLENQVP 

184 82 CTCCCGCACCTTGCCGAACATCTCCGACGAAGACGCCTGATAAAAGCGCGGCTGACCGGCTGCGGGACTGCGGGAATCCGACAGGCCCCCCA 

<ERVKGFMESSSAQYFRPQGAAPSRSDSLGGV 
18574 CG ATCCG C AAGG CTTCG AG C ATGCGG AG C AC ACCC ATG CCGGTG ACCT CCG CCGTCGTGGTGG ACTG C CG CC ACG ACACCGGC ACGT ACG AC 

< IRLAELMRLVGMGTVEATTTSQRWSVPVYS 
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18 66 5 AGCGCGCCGAGGTTGTAGACCTCGTCCGGCGCCGCGCGTTCGATCGCCGCCACCAGGCTCGTCTGATCCAGAAGGTCGCCGCTGATCAGCTT 

<L AGLNYVEDPAAREIAAVLSTQDLLDGSI LK 
18 7 5 8 GACCGCTGGATCAGGTTGCCGAAGGCTGCGGACCGAGGGCGCGGTCTGCCCGCGCACCAATCCAAATACCTCGTATCCGGACTGAAGCAGGT 
<VAPDPQRLSRVSPATQGRVLGFVEYGSQLLH 
18 8 5 0 GCTCCGCGAGATACGTGCCGTCCTGGCCGGTAATTCCAGTGATCAGCGCCCGCCGTGTCAGGGTAGTCTCCAGCCGTGAAGCCACCTGGCC 

< EALYTGDQGTIGTILARRTLTTELRSAV 
18 941 GAGGCGTGACCTCGCGCCGATGGCGGACCAAAGATCCGCCCGTTCGAAATGGGGTCGGATCTCCCGCTACCGCGCTCACGGTACGCGAATCT 
190 3 3 CCAAGCGGATTCAGCGACCCGGAAAGCAATATAGGGAGGTTACTAGTAGTACTTTCCGGCGGGCCGGCAGCGACGCCCGGCCCGGACCGGCA 
1912 5 GGATCGCCCCGTTCGGCCGGCGGACCCATCCCGGACACCTTCCACCGAAGCTCTTCGGGATCGTCGCCCGCCGATGCGAACTGCTTGACTCC 
19217 ACCGTTTTGTCCCCCTAACGTCGGCGAGGCTGCCAGCCGGCCCGCGAGCCGGGCCGGGTTCACGAAGGCGCCAACTTCCGGTGAGAGAGCAG 

193 0 9 GGCTCATGTCGCAGAGCCGGCCCGCGGCCGCAGCACCCCCAGCCCGCCTGCCACGTCGCCGCGCCATGGTCGCGTTGGTCGCGGTGATGAT 

>MVALVAVMI 
19400 CCCGATGGTGCTGGCCACCCTCGACAACACCATCATCGGCACCGCACTGCCCACCGTGGTCGGCGAGTTGGGCGGCCTCAGCACGCTCTCCT 

> PMVLATLDNTI IGTALPTVVGELGGLSTLS 

194 9 2 GGGTGATCACCTCGTACACGCTGGCCACGGCCGCCTCCACGCCGGTCTGGGGCAAGCTCGCCGACATGTACGGCGGCAAGGTGGTCTTCGTG 

>WVITSYTLATAASTPVWGKLADMYGGKVVFV 

195 84 GCCACGCTGGTCGTGTTCCTGGCCGGGTCGCTGCTGTCCGGCATGGCGCAGAGCATCACCCAGCTGACCGTCTTCCGGGCCGTGCACGGGCT 

>ATLVVFLAGS LLSGMAQS I TQLTVFRAVHGL 

196 76 CGGCGCGGGCGGCCTGATGGTCTGCGCGTTCGCCATCATGGTGGAGGTTCTCGCCGGCCCTGACCTGCCCAAGTACCAGGGCATCATGTCGG 

>GAGGLMVCAFAIMVEVLAGPDLPKYQG IMS 
1976 8 CGACCATGGGCCTGACCATGGTGGCGGGCCCGCTCGTCGGCGGCCTGATCACCGATGAGCTCGGCTGGCGCTGGTGCTTCTACATCAACCTG 

>A TMGLTMVAGPLVGGLITDELGWRWCFYINL 
19860 CCGATCGGGGCGGTCGCGCTGCTCATCGTGGTGCTGATGATGCACCTGCCGCGCCGACACACCAAGGCCCGGATCGATTACGCGGGTGCTGC 

>P IGAVALL I VVLMMHL PRRHTKARI DYAGAA 
199 52 CCTGCTCACCGTGGTCAGTTCGTGCGTCGTGCTGGTGACCACCTGGGGCGGCATCACCTACCCCTGGGCGTCTCCGATGATCCTGGGGCTGG 

>LLTVVSSCVVLVTTWGGITYPWASPMILGL 
2 004 4 TCGCGCTCGGGGTGCTGACCTGCGCGCTCTTCGTGGTGGTCGAGCGACGGGTGGCCGAGCCGTTGGTGCCCCTGGCCATGTTCCGCAGCCTG 

>V ALGVLTCALFVVVERRVAE PLVPLAMFRSL 
2 013 6 AACTTCACCCTGAGCACCCTCATCGCCTTCCTGGTCGGCTTCGCCCTCATCGCGGGGCTGACCTTCCTGGCCCTGTTCCAGCAGGCGGTGCA 

>NFTLSTLIAFLVGFALIAGLTFLALFQQAVQ 
2 02 2 8 GGGTGCCTCCGCGTCCGACTCCGGCCTGTTGCTGCTGCCCCTGCTGCTGTCCATGGCGGCGGTCAACGTGGTCGGGGGTCGCCTGATGAGCG 

>GASASDSG LLLLPLLLSMAAVNVVGGRLMS 
2 0320 GCGGGCGTTCCTACCGGCTGCTGATGCTCGCCGGTGCGGCGCTGATGACCCTGAGCCTGCTGCTCTTCGCCCTGATGGACGTGGGCACCAGC 

>G GRSYRLLMLAGAALMTLS LLLFALMDVGTS 
2 0412 CGGACGGTCACCGCGATCCCCATGGTCGGCTTCGGCGCAGGGCTGGGGCTGCTCATGCAGACCAGCCTGATGGTGGCGCTGAGCAGCGTGGA 

>RTVTAIPMVGFGAGLGLLMQTSLMVALSSVE 
20504 GATGAGGAACCTCGGGGTGGCCGCCTCCACGTCCACGCTCTTCCGCACCATCGGTGGGGCGGTGGGGGCGTCGGCGACGGTCTCGCTGTTCT 

>MRNLGVA.ASTSTLFRTIGGAVGASATVSLF 
2 05 96 CCGTGCGGGTGCAGTCGGCGCTGGCCGATCGGGGGGTCGCCGACGTGGCTGACCTCCTCGGCCACTCCGCGCGGCTGGACGCCGCCGGGCTG 

>SVRVQSALADRGVADVADLLGHSARLDAAGL 
20688 GCCCAACTCCCCCGGGCCGTCCGTGTCCACTTCATGCACGCGGTGGCCTCCGGCACCCGGTGGGCCTTCCTGATGACCGTGCTGGCGGGGCT 

>AQLPRAVRVHFMHAVASGTRWAFLMTVLAGL 
20780 GATCTGCGTCGCGGCGGCGTGGTTCCTGCGCCGGGTCACCCCGTTGACGTCGGCACCGGTGGCACCCGAACCGGCGCGCGACGTCGCCGCGC 

> ICVAAAWFLRRVTPLTSAPVAPEPARDVAA 
2 0872 CCGCCGCCAGCAGCGGGCGCGCGCCGAACTACTAGCGGATTTCCTAGGGTTCCTCGTCGACGGTAGAGCTGAATTCACCGGCGACCTAACA 

>P AASSGRAPNY* 
2 0963 TTCTTTTCGCGATCCGGAATCCGTCCATTCCCCTGTCTGGCGATGGTCGACGGGCCGGCCCGTGCCGAGCGGACAGACACAGATTCTCGGAT 
2105 5 TGGAGCTCGATGTCCAGCAAGATCCTAGTCATCGGTGGAGGTCCGGCCGGATCCACGGCCGCCGCGCTGCTCGCCCGATCGGGGCTGTCG 

>MSSKI LVIGGGPAGSTAAALLARSGLS 
2114 5 GTG ACG CTC CTGG AAAAGG AG ACGTTCCCG CG ATACC AC ATCGG CG AGTCG ATCG CGT CCT CGTG C CG CACCATCGT CG ATTTCGTGGGCGC 

>VTLLEKETF PRYHIGES IASSCRTIVDFVGA 
2123 7 TCTCGACGAGGTCGACTCGCGGGGCTACCCGCAGAAGAACGGGGTCCTGCTGCGCTGGGGCAACGAGGACTGGGCCATCGACTGGGCCAAGA 

> LDEVDSRGYPQKNGVLLRWGNEDWAIDWAK 

213 2 9 TCTTCGGTCCGGGCGTGCGGTCCTGGCAGGTCGACCGGGACGACTTCGACCACGTCCTGCTCAACAACGCCGGCAAGCAGGGCGCCAAGATC 

>I FG PGVR SWQVDRDDFDHV L LN NAGKQGA KI 

214 21 ATCCAGGGCGCGGCTGTCAAGCGGGTGTTGTTCGACGGTGAGCGGGCCACCGCCGCCGAGTGGTTCGACCCCGAGTCGGGTGAGGTCCGCAC 

>IQGAAVKRVLFDGERATAAEWFDPESGEVRT 
21513 CATCGATTTCGACTACGTGGTCGACGCGTCCGGCCGGGCCGGGCTGATCCCGTCCCAGCACTTCAAGCACCGGCGCCCCACCGAGACGTTCA 

> IDFDYVVDASGRAGLI PSQHFKHRRPTETF 
21605 AG AACGTGGC C ATCTGGGGCT ACTGG C AGGGTGG CTCG CTG CTG CCG AACTCT C C CT CCGG CGGG AT C AACGTCAT CT CCG CG CC CG ACGG C 

>K NVAIWGYWQGGSLLPNS PSGGINVISAPDG 
216 97 TGGTACTGGGTCATTCCGCTGCGCGGCGACCGGTACAGCATCGGCTTCGTCTGCCACCAGAGCCGCTTCCTGGAGCGGCGCAAGGAGCACGC 

>WYWVI PLRGDRYS IGFVCHQSRFLERRKEHA 
2178 9 CTCGCTGGAGGACATGCTCGCCGCACTGGTACAGGAGTCCCCGACCGTGCGCGGCCTGACGGCGAACGGGACGTACCAGCCGGGCGTGCGGG 

> S LEDMLAALVQESPTVRGLTANGTYQPGVR 
218 81 TGG AG C AGG ACTT CT CG T AC ATCT C CG AC AG CTTCTGCGGG C CCGG CT ACTT CG CGG C CGG CG ACT CCG C CTGCTTCCTGGACC C ACTGCTG 

>V EQDFSYISDSFCGPGYFAAGDSACFLDPLL 
21973 TCCACCGGCGTGCACCTCGCCCTCTACAGCGGCATGCTCGCCTCGGCGTCCATCCTGGCCACCATCCACGGTGACGTCACCGAGGAGGAGGC 

>S TGVH LALY SGMLASAS I LAT I HGDVT EEEA 
22 065 GCGdGCGTTCTACGAGTCCCTCTACCGCAACGCCTACCAGCGCCTGTTCACCCTCGTCGCCGGCGTCTACCAGCAGCAGGCCGGCAAGAGGG 

> RAFYESLYRNAYQRLFTLVAGVYQQQAGKR 
22157 CAT ACTTCGG C CTGG C CG ACG CG CTGGTG CACG AC AG CGG CG AACC CG AGT ACG AG AAGGT AG ACGGGG CC CGCG CCTTCGCCCAG CTCGTC 

>A YFGLADALVHDSGEPEYE KVDGARAFAQLV 
2224 9 GCCGGCCTCGCCGACCTGGACGACGCGGCGGAGGGACGGCACGACAGCACCGCGGCGGCGGCACCGGCGGAGCAGGACAACTCCGTCCGGCA 

>AGLADLDDAAEGRHDSTAAAAPAEQDNSVRQ 
22341 GCTCTTCCTGGCCGCCGAGGAGGCCCGCCGGATGGCCGACGCGCGCACGCCGAGCGCCCCGGTCAGCGAGGCGCCGGGCAAGCTCGACAGCC 

> LFLAAEEARRMADARTPSAPVSEAPGKLDS 
2243 3 ACG AC CTCTT CG ACT CGG C AAC CGG C CT CTACCTGGTC ACC ACC C CG CG ACTGGGG AT CCG CCGGGC C AAGCCGG CCG ACACG CAGGCGGCG 

>H DLFDSATGLYLVTTPRLG I RRAKPADTQAA 
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2 2 5 2 5 GCAGAGCAGTCTGCCTGAGGTTCCACCCCTGGTGGCCCCCGGCCCGGACCGCGCCGGTCCGGGGGCTGCTCAACCCTCCCACCAACATCCG 

> A E Q S A • 

22616 GCATCCGGTGCCGGCGGCTGAGCAGGGGCAGCGCCACCGACTCCGGCCCGTCACATGGACAAGGTCACCTCTCCCGTGCTGAACAGACGACA 
22 7 0 8 GTTGCTCGCCCTCGGCAGCGTGGCCGCCGGAGGCACCGTCATGTCACGCTCCCTCAGACGGGACGCGCAGGCCGCCCAGGCGGCACCGGC 

>MSRS L R RDAQAAQAA PA 
2 2 7 9 8 GTCGCCCGCCAACCCGCACGCCGGGCACGCCGCCCCGGTGCCCAGCCGGGTCAGCACGACCACGGTCGCGGTCACCCCGTTCACCGAGCCGA 

> S PANPHAGHAAPVPSRVS TTTVAVTPFTE P 

2 2 8 90 TGCCCGTCCCGCCGCGGCTGACCCCGGTCTCCCGCCGCGACGGCATCGACGTCTACGAG ATCCCCATCCGGCCGGCGCAGGTGCAGATCCTG 
>MPVPPRLTPVSRRDGIDVYEIPIRPAQVQIL 

22 982 CCCGGCCTGCTCACGCCCGCCTACACCTACGCCGGTTCCTTCGTCGGCCCCACCATCCGGGCCCGCACGGGCCGGCCCGTGCGGATCACCTA 

>PGLLTPAYTYAGSFVGPT I RARTGRPVRI TY 

23 074 CACCAACGGGCTCGACACCCACGCCAACGTGCACCTGCACGGCGGGCACGTGCCGGCCACCAGCGACGGTCACCCGATGGACCTGATCCCGC 

>TNGLDTHANVHLHGGHVPATSDGHPMDLI P 
23166 CGGGCGGCTCGAAGGTCTACGACTACCCGAACCTTCAGCGCGGCGCGACGCTCTGGTACCACGACCACACCCACGCCTACGAGGCCGACCAC 

>P GGSKVYDYPNLQRGATLWYHDHTHAYEADH 
232 5 8 GTCTACCGCGGACTGCACGGCTTCTATCTGATCGACGACCCGGCCGAGCATCACCTGCGCCTGCCCGCCGGCAAGTACGACGTGCCGATCAT 

>VYRGLHGF YLI DDPAEHH L~ R L P AG K YD V P I M 
2335 0 GCTGCGCAACGCCCAGTTCGACGACTCCGGCGCCCTCGTCTTCGGCCACCCGGACGACCGGGTCACCATCCTGGCGAACGGCAAGGCCCAGC 

> LRNAQFDD SGALVFGHPDDRVT I LANG K A Q 
23442 CCTACTTCGAGGTGGCCCCGCGCAGGTACCGGTTCCGCCTGCTCAACGCGGCGCTGAAGCACGTCTTCCGGCTCAACCTGGGCGGCGAACCG 

>P YFEVAPRRYRFRLLNAALKHVFRLNLGGEP 
23 534 CTCACCCGCATCGCCACGGACGGCGGGCTGCTGCCCGCCCCCACCAGTCACACCGAGCTGGCGCTCTCCTCCGGGGAGCGGGTCGAGATTGT 

>ltriatdggllpapts > htelalss~gerveiv 
2362 6 gatcgacttcgccgagcacgcaggcggcgggccggtctacctctacgacggggacaacccgatcctgcgcttcgacgtgtcgtcccgggcgg 

> idfaehagggpvylydgdnpilrfdvssra 
23 718 tcaccgaccccagccgggtgccggtcaccctgcgcgcactgcccccgatgggcacgccgaccgtggagcgcaccgtgtcgatgagcttcgac 

>vtdpsrvpvtlralppmgtptvertvsmsfd 
23 810 atgtcggcccggcccccgatcgcgctcatggacggcaaaccgttcgaccctctccgggtggacgtacaggtcaagcggggcagcaccgagat 

>msarppialmdgkpfdplrvdvqvkrgstei 
23 902 ctggaacgtggtcaacgcggataccgatccgttccccttcgaccatccgttccacctgcacctggtgacgttccgggtgctcggccgcgacg 

>wnvvnadtdpfpfdhpfhlhlvtfrvlgrd 

23 9 94 gcgggccgcccgcgccggaggacgccgggctcaaggacaccgtctacgtctcgcccaaggggtctgtcaagatccaggtcaccttcgccacg 

>ggppapedaglkdtvyvspkgsvkiqvtfat 

24 0 86 ccgtacctcgggcagtacgtctaccactgccactacctggagcactcgtcgctggggatgatggcccagctggaggttgtgccctgagggc 

>PYLGQYVYHCHYLEHSSLGMMAQLEVVP» 
24177 TCAGCCGTGCAGGTCGACGATCGAGGGGTGGGCGCCGAACAGGCTGACCGGCCGCACGTCGCCCACCCCGAACCCGGCGGCGCGGGCCAGG 

<*GHLDVIS PHAGFLSVPRVDGVGFGAARAL 
24 268 TCGG CCTGGTCGG CG AACTCGTG C AG C AG C AGT ACCGC C CG C C CG C CGTCG ACCGTC ACCCGGCG C AG CT CCGCG AAG AGG CGGCCGG AAT C 

<D A Q D A FEH L LLVARGGDVTV RR LEA F L R G S D 
24360 ACCGGCGAGCACGCCCCGCGCCTGCACCTGCCGGTCCCAGGGCGGATTGCTGACCACCCGGTCCACCCGACCGGTCCGCAGCGGCAATCGTC 

<GALVGRAQ VQRDWPPNSVVRDV RGTRL PL RG 
244 52 CGGCGTCGGCGACCGCCCAGGTGACGCGGGCCCCCGACGCCGCCGAGTTGGCGACGGCCGCGCCGACCGTCTCCGGGTCGTGGTCCGAGCCG 

< ADAVAWTVRAGSAASNAVAAGVTEPDHDSG 
2454 4 AACAGCACCGCCCCCGGTGCCAGCCCGGCTGCCTCCACGGGGATCGTGCCGGTGCCGCAGCACGGATCGGCCACCAGCATCCCGGGGCGGAT 

<F LVAGPALGAAEVPITGTGCCPDAVLMGPRI 
24 636 GCCGGCCAGCCAGGCCAGCGCCGCGGCGAGCGGCGGATGCAGGGTCCCCGGCGTGGACGACCGCTTGTAGGCCCGCCGGTGCAGCGGCCGGT 

<GALWALAAALP PHLTG PT S S RKYARRHLP RD 
24 72 8 CGGCCACCCGTACCGCCAGCGTCGCCTGGGTGCCCTCGACGGTGACCCGCAGCGACAGCCCGCCCTCCGGCGGGGCCTCGCCGCCGCGGCGG 

< AVRVALTA QTG EVTV RLS LGGE PPAEGGRR 
24 82 0 G AGTGGTAGCGC AACCCG AG CG CGG CC ACCGCGTG CCG C C CC ACGG CGTC CTCG ATGT CGT AC CGG TTGT AGTTGCGG CGGC CG AGG AAGG A 

<S HYRLGLAAVAHRGVADE I DYRNYNRRGLFS 
24 912 GGCGGCGACGTCCACGGTCGCCGGCCGCCCGGGCACGCCGCAGGCGGCGCGGGCCGGCAGCACCGCCGGCAGCGCCGCCGCCCGGGCCAGCC 

<AAVDVTA P RGPVGCAARA P L V A PLAAARALR 
25004 GGGTGAAGGCCGCCAGGTCCGCCTTGGTGTGGCCGACGCCGTCGGCGACGGCGACGAGCAGGAACAGGTCGTCGACGGTACGCAGATCCAGC 

<TFAALDAKTHGVGDAVAVLLF ;LDDVTRLDL 
2 5096 AGG CGCGGCT CCG CGCTGG CGG CGG AG AACC AC ACCTCG CGGTG C CGG CGGTG CTCG AC CCGG CCG AGGC CC CG CTCCTCG ATCT C CTGGG C 

<L RPEASAAS FWVERHRRHEVRGLGREE IEQA 
25188 GGCCACCTCCTCCAGCCCTCGCAGGGTCCGTGCCATGAAGCGCACCGTCACCGTCCGCCGATCCTCTCCCCGCCGGGTCGTCCCGCCGCC 

<AVEELGRLTRAM 
25278 GTGCCGGCGGCGTCCGCGAGCCTACCCACCGCACCCGCCGCCGGGACGCAGCCGACGCCGACCGGACCCGCCGACGGGGGAGCCGACGCCGA 
25370 C CGG ACCCGCC AAG ACT AGGTG AAC CT CT AT AGG AATTCGCGTGC C CCCTTC AT AGGGT CCGAAAGGGGT AATGG AACCGT CCGG CACCGG A 
2 54 62 CGGCTCGTTTTCTTCCCCCAATTCCGTCCGTCCGACCTGAGCCGTCGCAGGGAAGGCGAGGCCGAGCAGTCGAGCAGTTGATCGGTCGATG 

> M 

255 53 CCGCACGGGCCCGTGCGCCGAAATCGTGGAGATTGCGCAGTGCGTACACCGGATCTGTTCATCGGCGCCGTCGGCGCCTTCGTCCCGCCGAC 

>PHGPVRRNRGDCAVRTPD L F IGAVGAFVP PT 
2564 5 GGTG AG CGTCG AGTGGG CG AT CG AC CG CGGT CTTT ACT C C CG CG AGC AGG TGG AGCTGCACG AG CTGG CGGG C ACGG CC ATCGCCGG CG AC C 

>VSVEWAID RGLYSREQVELHELAGTAIAGD 
2573 7 TGCCCGCGCCGGAGATGGCGCTGCGCGCCGCCCAACAGGCGGTCAAGCGCTGGGGCGGCTCGCCGACGGAGTTCGACCTGCTGCTCTACGCC 

>L PAPEMALRAAQQAVKRWGGSPTEFDLLLYA 
2582 9 AGCACCTGGCACCAGGGGCCCGACGGCTGGCCGCCGCACTCCTATCTCCAGCGGCACCTGGTCGGCGGCGACCTGCTGGCGTTGGAGATCCG 

>STWHQGPDGWPPHSYLQRHLVGGDLLALEIR 
2 5 921 GCAGGGCTGCAACGGGATGTTCAGCGCGTTCGAGCTGGCCGCCAGCCACCTCCAGGCGGTACCCGAGCGCACCAGCGCCCTGCTGGTCGCCG 

>QGCNGMFSAFELAASHLQA VPERTSALLVA 
26013 CCGACAACTACGGCACCCCGATGGTCGACCGCTGGCGGATGGGCCCCGGCTTCATCGGTGGCGATGCCGGCAGCGCCCTCATCCTCACCAAG 

>A DNYGTPMVDRWRMGPGF I GGDAGSAL I LTK 
26105 CGACCCGGCTTCGCGCGGCTCCGCTCGGTCTGCACCAAGTCGGTCCCGGAGGCCGAGCGGCTGCACCGGGGCGACGAGCCGCTGTTCCCCCC 

>RPGFARLRSVCTKSVPEAERLHRGDEPLFPP 
26197 GAGCGTCCTGACCGGCCGGGAGCTGAACTTCACCGCCCGGATCGACCAACAGTTCGCCGCCCGCAGCCCCGCCTCGATCGCCATGGCGGACG 

> SVLTGRELNFTARIDQQFAARS PAS IAMAD 
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26 2 3 9 TCGGCGACCACATCGAGGAGGTCGTGGGGCGCGCCCTCGCCGAGGCGGAGATCGAGGTCGGCGACCTCGCCAGGGTCGCCTTCATGAACTTT 

>V GDHI EEVVGRALAEAEIEVGDLARVAFMNF 
2 6 3 S 1 TCCCGGGAGATCATGGAGCAGCGCTGCCTGGCCAACTGGGGCCTGCCCATGAGCCGGTCCACCTTCGACTTCGGTCGCCGGATCGGGCACTG 

>SREIMEQRCLANWGLPMSRSTFDFGRRIGHC 
2 64 7 3 CGGGGCGAGCGACCCCTTGCTGGCCCTGGAACACCTGGCCAGGACGGGGGGCCTCGGCCCCGGCGATCACCTGCTGACCCTCGGCACCGCGC 

>GASDPLLALEHLARTGGLGPGDHLLTLGTA 
2 6 5 6 5 CGGGCGTGGTGGTGTCGTGCGCGATCGTCCAGGTGATCGAGTCGCCGACGTGGCGGGAGTGACCCGCTGGACACCTGCGGCGGCCCGCCCA 

>PGVVVSCAIVQVIES PTWRE© 
2 6 6 5 6 GCCCAGCAAACCGACAGCAGGGGATGATTGTGGAAGCAGAGAAGGACCGGTTGCGTCCGGTGGCGTCCGAGGCGGTCGCCGTGGTGGGGA 

>VEAEKDRLR PVAS E A V A V V G 
26 746 TCGGCTGCCGGTTCCCGGGCGACGTCAACTCGCCCGACGAGTTCTGGGACCTGCTCACCGGGGGTCGCAACACCACCGGGACGGTGCCCGAG 

>IGCRFPGDVNSPDEFWDLLTGGRNTTGTVPE 
2 6 8 3S GAGCGCTGGAGCGCGTACCGCGACCTGGGTCCGGCGTTCGAGTCCGCGCTCCGGTCGGCCACCCGGGCGGGCAACTTCCTGGCCGACATCTC 

>ERWSAYRDLG PAFESALRS ATRAGNFLAD I S 
2 6 93 0 CGGCTTCGACGCGGACTTCTTCGGCATCTCCCCGCGCGAGGCCGAGCTGATGGACCCGCAGCAGCGGCTCATGCTGGAGGTGACCTGGCAGG 

>GFDADFFGISPREAEIiMDPQQRLMLEVTWQ 
2 70 2 2 CGCTGGAGGACGCCGGGATCCCGCCCCGCACCCTGGCCGGCACCGACGTCGGCGTCTTCGCCGGCGTGTGCACCTACGACTACGGCGGCCAC 

>A L E D A G I PPRTLAGTDVGVFAGVCTYDYGGH 
2 7114 CAGTTGGAGGACCTGCCGCACATCGACGCCTGGACGGGCATCGGCGCCGCCACCTGCGCCGTCGCCAACCGGGTCTCCCACGTGCTCGACCT 

>QLEDLPH I DAWTG IGAATCAVANRVSHVLDL 
2 72 0 6 GCGCGGGCCGAGCCTGTCGATCGACACCGCCTGCTCGGCGTCGCTGGTCGCGTTGCACCTCGCCGCGCAGAGCCTGCGGCTGGGCGAGAGCA 

>RGPSLS I DTACSASLVALHLAAQS LRLGES 
2 72 98 CGCTGGCCCTCGCCGGCGGGGTCAACCTGATCGTCACGCCCGGGCAGTCGATCACCCTCGGCTCGGCCGGTGCCCTGGCACCCGACGGGCGC 

>T LALAGGVNLIVTPGQSITLGSAGALAPDGR 
2 7390 AGCAAGTCCTTCGACGCCACCGCCGACGGCTACGGTCGTGGCGAGGGGTGCGGCGTCCTCGTGCTCAAGCTGCTCTCCGACGCCCAGCGGGA 

>SKSFDATADGYGRGEGCGVLVLKLL.SDAQRD 
274 8 2 CGGGGACCGGGTGCTGGCCGTGCTGCGGGGCAGCGCCGTCAACCAGGACGGCCGGACCAACGGGATCATGGCACCGTGCGGCCAGGCCCAGG 

> GDRVLAVLRGSAVNQDGRTNG IMAPCGQAQ 

2 7 574 AGCACGTGATGGTCCGCGCCCTGCGCTCGGCCGGCATCGAGGCCGGCAGCGTCGACTACATCGAGGCGCACGGCACCGGCACCCCGCTCGGT 

>E HVMVRALRSAGI EAGSVDYI EAHGTGTPLG 
2 766 6 GACCCGATGGAGGCCGCGGCGATCGGCTCGGTCTACGGGCAGGACCGCCCGGACGACGAGCCCTGCCTGATCGGTTCGGTCAAGTCCAACAT 

>DPMEAAA I GSVYGQDRPDD E PCLI GSVKSNI 
2 7 75 8 CGGCCACCTGGAGGGCGCGGCCGGCGTCGCAGGCGTCATCAAGGCGGTCCTGGCGCTGAACCGGGCCGAGGTGCCCGCCACCCTGCTGGTCA 

>GHLEGAAGVAGVI KAVLALNRAEVPATLLV 
2 7850 CCGAGGTCAACCCGGACATCGAGTGGAAGCGGCTGCGGCTGCGCCTGGTCACCCGCAACCAGCCCTGGCCGGACCGGCCGGGGCCGCGCCGC 

>T EVNPDI EWKRLRLRL.VTRNQPWPDRPGPRR 
2 7 942 GCCGGAGTCTCCGGCTTCGGCTACGGCGGCACCGTGGCGCACGTGGTGCTGGAACAGGCCCCGCCGGTCGCCGCCGAGCCGGCCCCGGCGCT 

>AGVS G FG YGGTVAHVVLEQA P PVAAE PA P AL 
2 8034 GACCGGCGAGACGCTGTTCCCGATCTCCGCCGGCTCCGCGCACTCCCTTCGCGAGCGGGCCCGCGCCCTGGCCGGGATCGTCCCGGATGTCG 

> TGETLFP I SAGSAHSLRERARALAG IVPDV 

2 812 6 ACCTCGCCGCGCTCGGGCACACCCTGGCTCGGCGGCGTTCGCACCTGACCCACCGGGCGGTGGCGGTGGCCGCCGGCCGGGACGACCTGGTC 
>D LAALGHT LARRRSHLTHRAVAVAAGRDDLV 

2 8218 GCGGCGTTCGCGGCGCTCGCCGACGACAGGCCGCACGACCGGGTGCGTACCGGAAGCCCGGTGGCGGAGCCGCCCCGCACGGTGTGGGTGTT 
>AAFAALADDR PHDRVRTGS PVAEP PRTVWVF 

2 8310 CTCCGGGCACGGGTCGCAGTGGACGGGCATGGGGCGGGAACTGCTGGCCACGGAGCCGGCCTTCGCGGACGCGATCGACCGCATCGAGCAGA 

> SGHGSQWTGMGREL LATE PAFADAI DRI EQ 

2 84 02 TCTTCCTCGACGAGATCGGTTTCTCACCCCGCCAGGCGATCCTCGACGGCGACTACGAGGCCGTCGACCGGACCCAGACAATGATCTTCGCG 
>I FLDEIGFS PRQAILDGDYEAVDRTQTMI FA 

2 84 94 ATG C AG CT CGG CCTGG C CG AG ATGTGG CG AGCG AGGGG AGTCG AGC CGG ACG CGGTG ATCGGCC ACTCGGTCGG AG AG ATCG CCGCGG CGGT 
>MQLGLAEMWRARGVEPDAVIGHSVGEIAAAV 

2 8 58 6 GACCGCCGGCATCCTGACCGTGGCCGACGGCGCACGGCTGATCTGCCGTCGTTCCCTGCTGCTGCGCGAGGTCGCCGGCCAGGGCGCGATGG 

> TAGI LTVADGARLICRRSLLLREVAGQGAM 

2 8 6 76 CCCTGGTGACGCTGCCCTTCGAGGAGGTCGCGGCCAGGCTGGCCGGCCGCGTCGACGTGGTCGCCGCGATCGCCTCCTCCCCCTCGTCGACC 
>A LVTLPFE EVAARLAGRVDVVAAIASSPSST 

2 8 770 GTGGTCTCCGGCGACCCGGCCGCGCTGGACGCGCTGGTCGCCGAGTGGACCGAGGAGGGCCTGGGCGTACGCCGGGTCGCCTCCGACGTGGC 
>VVS GD PAA LDALVAEWTEE G LGVR RV ASDVA 

2 8 862 CTTCCACAGCCCGCACATGGATCCGCTGCTCGACCGGCTGCGCGCCGCCGTCGACTTCACCGCCCGCGCACCCCGGGTGCCGATCTACAGCA 

> FHS PHMD PLLDRLRAAVDFTARAPRVPIYS 

2 8 954 CGGCGCTGGCCGACCCGCGGGCCCCGATCACCGCCGACGGCGAGTACTGGGCCGCGAATCTGCGCAACCCGGTCCGGCTCGCCGCAGCGGTG 
>T ALADPRAP ITADGEYWAANLRNPVRliAAAV 

2 9046 GCCGCCGCCGTCTCCGACGGACACCGGGCCTTCATCGAGGTCTCCCCGCACCCGGTGGTGACCCACTCGATCCACGAGACGCTGGCCGGAAG 
>AAAVSDGHRAFIEVSPHPVVTHSIHETLAGS 

2 913 8 CCTCGACGACGAGGTCTTCGTCGGCGGCACCCTGCGCCGCGACACCCCGGAGGCGCAGGCCTTCCTGTCCAGCCTGGGGGCCGCGCACTGCC 

> LDDEVFVGGTLRRDTPEAQAFLSSLGAAHC 

2 923 0 ACGGGGTCGCGGTCGACTGGGGCCGGGTGCATCCGTCCGGGCCGCTGGTCACCCTGCCCGGCTACCCCTGGCGGCACCGGAGTCACTGGCAC 

>H GVAVDWGRVHPSGPLVTLPGYPWRHRSHWH 
2 93 22 TGGCCGACGCCGGCCGCCGCCACGGGCCGGGGCCACGACCCCGCGTCGCACACCCTGCTCGGCGCGGTCGACAACGTGGCGGGCAGCGACGT 

>WPTPAAATGRGHDPASHTLLGAVDNVAGSDV 
2 9414 GCGGGTGTGGCGCACCGCACTCGACGACGCCAGCCGCCCGTACCCGGGCAGCCACGCCCTCAACGGCGTGGAGATCGTTCCGGCGGCCGTGC 

>RVWRTALDDASRPYPGSHALNGVEIVPAAV 
2 9506 TGGTGGAGACCCTCATGGCTGCCGCCGGGCGGGGCGACGGCCGGCCGCTGCTGACCGGCTTGTCGATGCGGTACCCGCTGATGACCGCCGGG 

>L» VETLMAAAGRGDGRPLLTGLS MRYPLMTAG 
2 95 98 CTGCACGAGGTCCAGGTGGTGCGGGACGGTGCCGAGGTGCGGCTGGCGTCCCGTTCCGTCGACGCGGAGGCCGACCCGAGCCGGGACTGGCT 

>LHEVQVVRDG .AEVRLASRSVDAEADPSRDWL 
2 96 90 GATCCACACCGACGCCACGGTGGCCGACGCCGACGCCACGGTGCTCGCCGCGCGGGCGTTGGCCGACCCCGACGACCACCGGATGGAACCGG 

> I HTDATVADADATVLAARALAD PDDHRME P 

2 97 82 GCGACCCGGGCTCCATCCACCGCCGGCTCGCCGAGGTCGGGGTGCCGTCGACGGGATTCGACTGGTCGGTGGAGGAGCTGCTCTCCGGGTAC 
>G DPGS IHRRLAEVGVPSTGFDWSVEELIiSGY 
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2 9 6 7 4 GGCGTGCTCCGCGCGCGGGTGCGCTCGGCCGACTCGTCCACCTGGGCGCCGGTGCTGGACGCCGTCATGTCGGTCGCCCCCGCCGTCTTCCC 
>GVLRARVRSADSSTWAPVLDAVMSVAPAVFP 

2 9 96 6 CGGCGTGCCGCAGCTACGCATGGTGGTGTACGTCGACGAGGTGCTGCTCACCGGCGAGCCACCGGAGGTGACGCTGATCGAGGTCGCCCTCG 

>GVPQLRMVVYVDEVLLTGE P PEVTLI EVAL 

3 00 5 8 ACCCAGACCGGCCCGACACGGCGAACGCGCTGGTCGCGGATGCTCAGGGCCGGGTCGTGGCCAGCCTTCCCGGGCTGCGCTACCCGGTGATC 

>D PDRPDTANALVADAQGRVVAS LPGLRYPVI 
3 015 0 GACCAGCCGGTCGCCCCGGCGCAGGACAGTTCCGGCGAGGTGGAGGAGGCGGTCTCCTTCGCCGGCCTGTCCGACGAGGAACTGCACGAGCG 

>DQPVAPAQDSSGEVEEAVSFAGLSDEELHER 
3 02 4 2 GG TGTT CG ACG AGGTG CG CCGG CAG AT CG CCGGGG AG ATG CG ACT CG ACG C CG ACG AC CTG C AT C C CCG C CGG CCG CTGG CCG AG C AGGG C C 

>VFDEVRRQ IAGEMRLDADDLH PRRPLAEQG 
3 0 3 3 4 TCGACTCGGTGATGACGGTGGTGATCCGGCGACGCCTGGAGAAGCGCACCGGGCGGAGCCTCTCCCCGACCGTCTTCTGGCAGCGGCCCACC 

>L DSVMTVVIRRRLEKRTGRSLSPTVFWQRPT 
3 04 2 6 GTCGCCGCCATCGCCGACCACCTGGTGGAGCTGTTGAGCACCCCGCAGGAGTGAGGATCCGCGACGACAGGGAGGGCCCGTGCGTTCCGCA 

>VAAIADHLVELLSTPQEo 

3 0 517 CGGGCCCTCCCTGCTGTCGACGGATCTTCAGGTGGCGGGGTCAGCCCGGCCGCTGGTCCACCGCAGGGGTGGCGGCCCACTCCAGGTGGCG 

<oGPRQDVAPTAAWELHR 
3 06 0 8 GCTCTCCCGCAGGGTCTCCTCGGCGTTCTGTTCGACCCGCCGCATGGCGACGCGCAGCATCGGCGGCGCCATGACCGAGGTGACGACCGCGA 

< S ERLTE EANQEVRRMAVR L M P PAMVSTVVAV 
3 07 00 C CAG C AC C ACG ATGGTGT ATG AGG CGGTGTT CAGC ACG C C CAGGCG CAG C C CG ACCATCGCG ATG ATG ATCTC CACGG CAC CG CG CG CGTTG 

< LVVITYSATNLVGLRLGVMAI I IEVAGRAN 

3 07 9 2 AGCCCGGCGCCGAGGGCGACGCCCTCCCAGTGGCTCTGCCGGGCCAGCCGTGCACCCAGGTACGCGCCGGTGTACTTGCCGAGCACGGCGAG 
<LGAGLAVGEWHSQRALRAGLYAGT_YKGLVAL 

3 0 8 8 4 CG CC AGG AT CAC CGC CCCGG CGG CG AG C AC CTC CGGGT CGG CG AGGG CACG C AG AT CG AC C CGG AG C C CCG CG CTGG CCAGG AAG ATCG GGG 
<ALIVAGAALVEPDALARLDVRLGASALFIPA 

3 0 9 7 6 CGAGCACGGACAGCACCACCGTGCGCAGCGGTGCCAGCCGGGCCGGCTCCCCGTTGCCGGGAAGGCCGATCAGGACACCCGCCACCAGCGCG 

< LVSLVVTRLPALRAPEGNGPLGILVGAVLA 
3106 8 CCGAAGATCGCCTCCAGGCCCAGCGCGTGCGCCCCCGCCGCGAAGGCCAGCACGATCACCACGGCGACGGCGCTGGCGGCCCCGCCGTCGGG 

<G FIAELGLAHAGAAFALVI VVAVASAAGGDP 
3116 0 CTGGGCGTTCGCCCAGCGCATGGCGCGCCGGGTCACCGGCCGGCCGACCAGGACGGCCACCGCGAGGTAGGCCACGAGGTAAAGCAGGGCGG 
<QANAWRMARRTVPRGVLVAVALYAVLYLLAT 

312 5 2 T C ACC ACCTG C C CGG CGGTG AGGGTG CT C AC CGC CAC CG ACG AG ATG AG CG AC AG C AGG AACC AGGC CG C AG CGTCCTCCAGCG ACG C CG C A 

< VVQGATLT SVAVSS I LSL LFWAAADELSAA 

313 4 4 GCC AGG ATG ATCTG CCCCACGT CG CGGTG CAG C AGG CG CATG TCGG TG AG CGTCTT CG CG AT CAC CGG C ACGGCGCTG AC CG CCAT CG CC AC 

<ALI I QGVDRHLLRMDTLTKA I VPVASVAMAV 

314 3 6 GCCGACGAACAGCGCGAAGACAGTCCGCTCCTGCCCGGCCGCGAGAAGCGCCGCCGGGGCCAGCAGGCCGGCGGCGATGCCCAGGCCGAGGG 

<GVFLAFVT REQGAALLAA PALLGAAIGLGLP 
3152 8 GCACCGCGAGGCCGCCGACGGCCACCGCGCCGATCGTGCCGGCCCGCCGCCGGACCAGCCGCAGGTCCAGGTGCAGACCGGCCACGGCGACC 

< VA L G G VA-V A G I T G A R R RV L R L D L H L G A.V.A JV 
3162 0 AGCAGTACGACGCCGAACTGACCGATGGCGTCGAGCAGGTGGACCTGGTCGGGGTCGGCGGGCAGCAGCCACCGTCCGATGTCGGGTGCCAG 

<L LVVGFQGIADLLHVQDPDAPLLWRGIDPAL 
31712 GGCCCCCAGCACCGAGGGGCCGAGCAGCACCCCGGTCAGCAGCTCACCGACCACCGCCGGCAGGCCGAAGCGTTGCGCCACCCGTCCCAGGA 

<AGLVSPGLLVGTLLEGVVAPLGFRQAVRGLV 
318 04 CGACGGCGAGCAGCAGCAGCAGGCCCACCTGGAGCAGGAACAGCAGTAGCTGGTGGGAGCCCAGCGGGGGCACCGGCGCGGCCACGATCA 

< VALLLLLGVQLLFLLLQHSGLPPV 

318 94 CGGTCGTTGTTCCTTTCGTCTCGACGCCCGGCCGGGCCCGCCGGTGGTCAGGCCGCGATCTCGGGCGGCAGGTCCACCGGGTCGGCCGAGT 

<°AAIEPPLDVPDASN 

319 8 5 TCATGAAGGTCCGCAGGGCCCGCAGGTCGGCGGCGGGCTTCTCCCAGCCCAGCCCGGCGCACCAATCGATGAGGGAGGTCAGCTCCCCGGTG 

< MFTRLARLDAAPKEWGLGACWD ILSTLEGT 

3 2077 GCCCGAACCTTGTCCTCCTCGGGCAGGCTGAGCACCGACATCTCGGCCGGCCACTGCACCACGTTGGCCAGGTCCACGTCGAGCCCCTCGGC 

<A RVKDEEPLSLVSMEAPWQVVNALDVDLGEA 
3216 9 CCGGGCGAACTCCAGCAGGTCGCGCAGCCCCCAGACGTTGTCCCGCTGGGGTGCCACCTGGAGCCAGAGGTTGACCTCCGAGCGGGCCCGGC 

< R A F E LLD R LG WVNDRQ PAVQ LWLNVE S RARR 
3 22 61 GGACGTTCGCGATGAAGGTCTCCCACTTCGCGCCCTGCCGGATCCGCTCGAACACCTCGCCGTAGCCGTCGCAGGAGGCGCCGATGCCGATG 

< VNAI FTEWKAGQRIREFVEGYGDCSAGIG I 

3 2 3 53 CTCTTGAAGTGCCGGAACCGGTCGAAGACCGACTCCGGCAACACGGTGAGGTTGGAGTTGTAGACGACGTCGACGTTGCCGGCGTTACCCGT 
<S KFHRFRDFVSEPLVTLNSNYVVDVNGANGT 

3 24 4 5 CTCCACCAGCAGGTCGAGCAGGGCGAAGTGGCCCGGCTGCATGAACGGCTCCCCACCGGCGAAGTACAGCCGCCGGATGAGGTGGGCGTTCT 
<EVLLDLL»AFHGPQMFPEGGAFYLRRILHANE 

3 2 53 7 CGCGCAGGGTCTGCCACAGCTCGTCGTCGTCCCGGTAGGCGTCGATGACCGCGGACGACCAGGCCGGCCGCTTCTTGGCGCCCCAGCCCGAG 

< RLTQWLEDDDRYADIVAS SWAPRKKAGWGS 

3 262 9 CTGACCGGGTACGCGCACATCACGCACCGCAGGTTGCAGGTGTTGCCGAACCGGATGTCGAGGAAGAACGGGAAGTCCTCGACGGTGCCGTC 
<S VPYACMVCRLNCTNGFRI DLFFPFDEVTGD 

3 2 721 GGGCGCGGTGCGGGCGGCCAGCCGGTCGGGGTCGGCGATGTCCCGGAACCGCTGGTTGATCTCCTGCCGGTACGACAGCGCGCCGTGGTCCT 
<PATRAALRDPDAIDRFRQNIEQRYSIjAGHDE 

3 2 813 CGCGGTGGTAGCAGTAGGAGCAGGCGTCGACCCGCTCCCCGGCCAGCATCGCCAGCCGGGTCCGGCGCATGTTGGGGCTGTTGAAGGCGTC 

< RHYCYSCADVREGALMALRTRRM 

3 2 904 CGCCAGGCCCATCACCCGGCCGGGGTTGTCCGCCGCGTAGCGGGACCGCGGCGAGCAGCCGACGGCGTCGTCGTTGAGCAGGAACTCCGGCT 
3 2 9 96 CCTCCTCCTGCTCGTACAGCTTGTTGTGGTACATCGAGTCGTCCACGCAGCACCGCCCGTAGACACCGTCGATGGACGCGCAGAGATGGATC 
3 3 0 8 8 CAGGGCAGCACGCACATGGTCCGGGACCGGGCTTCCGGGTCAGTCATGAAAGTTGATCACCTCGGTGGTGGGGGCGGTGTCATCCCGGTCG 

< • G P R 

3 317 9 GCCGACCGTCTCGACCCGGGGCAGCGGAAAGATCAACCGGGTGCCGCTGGCCAGCATCTCCGCCTCCCGGGCGACGATCTCGTCCCGGAAGT 
<GVTEVRPL P FI LRTGSALMEAERAVI EDRFH 
3 3271 G CC AGGGC AGG ACC AGGTAGT AGTCCGGGCGGGCCGC CCG CG ACTC CTG CT CG CTG ATG ATTT CG ATGTCCGTGC CG AGGGTGCG CGCG CC C 

< WPLVLY YDPRAARSEQES I I E IDTGLTRAG 

3 33 6 3 ACCTTGTCCGGATTGCGCTCCGCGGCGTACCGGATGAGCTCGCGGTCTATGCCGCAGAACTGCAGGAGGGTGTTGCCCTTCGTCGACGCGCC 
<V KDPNREAAYRI LERD IGCFQLLTNGKTSAG 

3 34 5 5 GTAGACGTGCACCGTGCGGCCCTGGCCCCGCAGCTCGCGCAGCAGGGCGCTCACCTCGTCACGGTGCTGGCGCACCTGCTCGGCGAAGCGCT 
<YVHVTRGQGRLERLLASVEDRHQRVQEAFRQ 
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33 54 7 GGTACGGGGCGTCGCCGTCCAGCCCCAGCGCCAGCTCCCGGTCGGCGAGTGCCTTGACCGAACCGTCGGCCCGGCCGCCCACCTCACCGGCC 
<YPADGDLGLALERDALAKVSGDA RGGV£GA 

33 63 9 CGGGTGACCACGCAGCAGATCGAGCCGCCGTTCACCCCGTTGAGGCTGGCACGGACGATCTCCAGGCCCGCCGCGCCCAGAATGCGGCTCAG 
<R TVVCCI SGGNVGNLSARVI ELGAAGLIRSL 

33 7 31 CGTGGCCAGCGAGTAGTAGGACAGGTGCTCGTGGCAGATGCTGTCGTAGCCGGCGATCTCCAGCATCGCCGGCAGGTAGGCGACCTCGACCA 
<TALSYYSLHEHCISDYGAIELMAPLYAVEVV 

33 823 CCCAGACCCCGCCCGGGGCGAGCAGCGCCTCGACCTGACGGGCGAACTCCACCGGGTCCTCGACGTCGTAGAACATCGCGATCGAGGTGACC 

<WVGGPALLAEVQRAFEVPDEVDYFMAISTV 
3 3 915 AGGTCGAAGCTGCCCGCGTGCGGGACCAGCTCGGGGCTGGGGAAGAAGTCGCGGATCAGATTGAAGTCGTCGGGCGCGTCGTCGGCGGCGCT 
<L DFSGAH PVLEPS PFFDRI LNFDDPADDAAS 

34 007 GGAGGGGTCGATGCCCCACCGCTGCGCGTCGGTCAGGTTGCCCAGCAGGGTGCCGTCGTTGCAGCCGATGTCGAGCACCTTGCCGGGCCGCT 

< S PDIGWRQADTLNGLLTGDNCG IDLVKG P RE 
34 0 9 9 CCCCCAACACCTCGACCGCCGCGTCCACGACGTCGGCGAGATGCCGGCGCATCGTGTCGTTGATCCGCGAGCGGTACCAGTAGGTGTCGTAG 

<GLVEVAADVVDALHRRMTDN I RSRYWYTDY 
34191 AGCAGCCCACCCGGCAGGGTGTGCCGCAACTGCACCAGGCCGCACGGGTCACCGCCGTCGCGTTCCGCGCACCGGGTCAGCTCCAGGGGGAA 

<L LGGPLTHRLQVLGCPDGGDREACRTLELPF 
3 4 2 83 GCGCACCCTGGGCGGGTCCGACACGCCGGGCTTCACGAAGCTGCCCTGTAGGTACTGGGCACCGAGGTCGAGGACCGTACGCAGGGTCCCGC . 

<RVRPPDSVGPKVFSGQLYQAGLDLVTRLTGG 
34 3 75 CACACACCCGGCAGGTGGTCCGCTCGACCACCTCCGACGCCAACTCCTCGCCGGCCGCCGCCAACTGGCTCACGTCAGGTCCTCCTCGT 

<CVRCTTREVV < » T R R T 

344 64 CTCGTGCCGGTGTGCGGGACCAGGCGCCCCGTCGTGCGGTTTGCTGGCGACCAGCAGGATGTCCAGATAGAAGGGCTGGTCGGGGCCCTCGC 

<EHRHAPG PAGDHPKSAVLL I DLYF PQDPG ES 
34 556 TGCGGCCGAGATGCCGGAACGCCCGGTCGAGGTACTCGTCGAGCGCGCGGGGCCGCAGCCGGTCCACCAGCCACAGGGCGCGCAGGGCCAGA 

<RGLHRFARDLYEDLARPRLRDVLWLARLAL 
34 64 8 CCCACCGGGCCGCCGGACGGCCAGCCGTGCTCGCGGCCGTACCAGCGCAGCAGCAGCAACAGGCCGCGCGGCCCGCAGGTCAGCTTGACGGT 

<G VPGGS PWGHERGYWRLLLLLGRPG.CTLKVT 
34 74 0 GCGGTCG ACGGTGAAGCCGGCCCACTCGGCCTGGCGGGCCAGCCCGTCGGCGGTCCACCGCCACAGGTCCTGGCCGCCGTGCTCCTCCCACA 

cRDVTFGAWEAQRALGDATWRWLDQGGHEEWV 
34 8 3 2 CCCCGTGGGTGGAGAGCACCAGCCGGCCGCCCGGGCGCAGCAGCCGGTACGCCTCCCGCAGGTACGCGTCCGCGTCCGAGACGTGTTCGAGC 

<GHTSLVLRGGPRLLRYA ERLYADADSVHEL 
34 924 ACCTGGGTGGAGAGCACCCCGTCGAACGTCCCGTCCGGCACCGGGCAGCGCCCGTCCCGGTCGAGGGCGTGGTCGGCCGGCAGGGACTCACC 

<V QTSLVGDFTGDPVPCRGDRDltAHDAPLS EG 
3 5 016 GCCCGGGATGTCGGCGGTCTGCAACTCGGCGGAGCGGAACAGGCCCCGGTATGGCGAGGTGCCCGCGCCGTAGTCGAGCCAGACTCCCGTGG 

<G P I DATQL EASRFLGRY P S TGAGYDLWVG TA 
3 510 8 CGTCCCGGACCGCCTCCGCCAGCGCGTCGCGCAGGTCCAGGAAGTGGGCGTATGCCCAGTCCCCGGGCCGGGGCTCGATCCGCTCCCGGAAC 

<DRVAEALADRLDLFH.AYAWDG PRP EI RERF 
3 52 00 CGTTCGGCCATCACCTCGTCCAGCGACGGGCCGGTGGGGGCGGTGCCGGCACCGGTCAGCAGGGGACCAAGCCCGGGTACGCGCCGCAGC 

<R E A M 

3 52 90 AGGGTCACCAACGCCAGTACTCCTCGTGCGGGTACCGCATGGTGGTCGGCTCCGGCAGGCCCTCCCAGCGCCGCAGCGACGCGGCGGGCCG 

<oWRWYEEHPYRMTTPEPLGEWRRLSAAPR 
3 53 81 GT ACT CG AAGGG CTG CTCGCCGG CG TG CT CCGGTG C CG C C CGCCG CTG CG C C AGG C AGG AG AG C AG C ACCTG CTG CTTG AGCC CG ATGT ACT 

<YE F PQEGAHEPAARRQALC S LLVQQKLG I YE 
3 54 73 CCGGCACGTGCGGGCCGAGGTCCAGCTCGACCTCGGCGGGCCGGTACTCGAAGTAGATGACCCGGCGCCGCTTGCCGGTCACCGCCGGCGCG 

<PVHPGLDLEVEAPRYEFYIVRRRKGTVAPA 
3 5565 GCGTGCAGCATCAGGATGTTGTGCAGCATCACGTCGCCCGGGTTCATCACCGCCGGCACCGCCCCGGTGGTGTCCCACTCGGTGGCGTTCAT 

<A H LM.L I NH LMVDG PNMVA P VAG TTDW ETANM 
356 57 CCGGGTGGTGGTCTCGTTCGCCCGG T CGGTG T CCC AGT AGTTCG ACTG CGGG ATG C ACC AG ACG C AGTTGT C CTCCGGGGCCGGGT C AAGGT 

<RTTTENARDTDWYNSQP I CWVCND E PAPD LY 
3 574 9 AGATGCCGACGTCGATCACCCGGCCCGCGCCGGTGATGCCGACCGCGTTCTCCGGGTAGAGGCCGCCGTCGCGGTGCCAGGGCAGCCGGGGC 

< IGVDIVRGAGTIGVANEPYLGGDRHWPLRP 

3 5841 GCCCCCGCCTCGGTCTTGAAGACCATGCTGTCCCAGGTGGGGATGAGGTTGGGGCCGACCAGGTCCTCCATCGCCCGCAGCAGCAGGGGGTG 

<A GAETKFVMSDWTPILNPGVLDEMARLLLPH 
3 5933 GCCGGCGAGCCGGGCGACCACCGGGGACTTGTCGACCACGTACTCGATCCGCACCGGCGCCGCGTCCGGCTCGTGCGGTTCCAGCGTCCAGA 

<GALRAVVPSKDVVYEIRVPAADPEHPELTWI 
3 6025 TGGTGTCGGTCATCGTCCGGGTGCGCCAGGCTTCGTCGATCAGCTCGTCGGCCGCCGCCTGCACGGACCGCAGCTCGTCGGGGTCCAGCAGC 

<TDTMTRTRWAEDILEDAAAQVSRLEDPDLL. 
3 6117 CCG CG C AGG ATC AG CG CG C C CTG C CGG CGG AAGG CGGTC AGGTG CTC CGG AAG C AG C C CGGTCTCGTGG ATG TGG C ACTCGGGG ACGGCCTG 

<GRL I LAGQRRFATLHEPLLGT EH I HCEPVAQ 
3 6 2 09 CTCGGTGCGGACGTCCACAGTCGCGCTCATGGTTCGGTTCCCTTCTGCCAGGCGGACGGTTCGTGCTGCCCGGAGCCGCCGGCCGGGCCCG 

<ETRVDVTASM 

<o PETGKQWAS PEHQGSGGAPGP 
3 63 00 GGCTCGGTCGGTCGGCGACGAAGTACCAGTGCTCCCGCAGCGCGTCGGCGAACCCCGCCCGGTCCAGCGCCGGCTGCCCGGCGCGGCGCCCG 

< S PRDAVFYWHERLADA FGARD LAPQGARRG 
363 92 GGCAGGGTACGCAGCTCGTAACCCAGCTCGGTCACGAGCAGCGCCCACAGGTCGGCGCTGGTGGTGCCGTACTCCCGCATGGCGTGGTCGCC 

<P LTRLEYGLETVLLAWLDASTTGYE RMAHDG 
3 64 84 GCCGTGCTCGAAGACGATCACCGGCCGCCAGCGGCGGAGCAGCTCCACGGCACCGCGCAGGGCGAGCACCTCGCCGCCCTCGGTGTCCACCT 

<GHE FVI V P RWRRLLEVAGRLALVEGGETDVK 
3 6576 TGACCAGGTCGATCCGGCGGTCACCGGGGAGCACGTCGTCCAGGCGGACGGTGTCGACCGTCAGCTCCCGCAGGGTCTCGTCCGGGCGGTCG 

<VLD I RRDG PLVDDLRVTDVT L ERLTEDPRD 
3 6668 TAGGGACGCCGGCGCAGCCCGCTGTAGCCGGGGTTGGAGACCACGTGGACGAAGCTGTCCCGGCCGGTGCGCTCGGCGGCGGCGGCGGCCAC 

<Y PRRRLGS YGPNSVVHVFSDRGTREAAAAAV 
3 6760 CACCGTCACGCCGGGGAAGTCCCGGCGCAGCCCCTCGGCGTACGACGGCAGCGCCTCGACGGCCACGTGCCGGCCACGGGGGGCGACCCGCA 
"<VTVG P F D R RLGEAYS PLAEVA VHRGR PAV RL 
368 5 2 GCAGGTGACGCAGGATGTCGCCGGCGCCGGCCCCGATGTCCACGGTGTTGGCGTCCGGTTCGCAGATCTGCTCGATCAGCGCCACGGTGAGC 
y ,;, <LHRLIDGAGAGIDVTNADPECIQEILAVTL 
3*694 4 TGGTCGTACCAGTCGTTCATCGACAGCGCGCCGGCCTCGTCGGTCGCGGAAAGCTCAGTGGACATCGTCACGCTCCTCGGTCCGGCACGCC 
V.. <• HVDDREETRCA 

<Q D Y W D N M 
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3 7 03 5 GGTCCGGCCCCGGAGCCGACCGCCGGGGCGGTACGGACCAGGAGTTCCAGCTCCCGCAGCTCGATCTCGGACAGCTCCAGGCCGGCGGCGCG 

<P GAGSGVAPATRVLLELERLEIESLELGAAR 
3712 7 GACGTTCTCCTCGACGACTCCGGGCGACTGAGCGCCGAACACCGGCACCACCCCGGCCGGGTGGTGCAGCGCCCAGGCCAGTGCCACCTGCG 

<VNEEVVGPS QAGFVPVVGA PHHLAWALAVQA 
37219 CGACGGTGTGCCCACGCTCGGCGGCGAAGGCGGCGAGGCCGTCGACCACGTCGAGCAGTTGGGCGTAGTCCTCACCCCGGAAGGCGTGCGAG 

<VTHGREAAFAALGDVVDLLQAYDEGRFAHS 
3 7311 TAGGCCCGCCAGTCCTCGGGGGCGAATGCCTGGTCGCGGTGCAGCGCGCCGGTGAGCAGCCCGTGGGCGAGCGCGGAGCCGCCCAGCACCCC 

<Y ARWDEPAFAQDRHLAGTLLGHALASGGLVG 
3 7403 GACGCCGGCCTCCTGGCAGCGGGGCAGCACCTCCTTCTCGGCACCACGGTCGAGCAGGTTGAACGGCACCTGGACGACGTCCAGCAGCCCGG 

<VGAEQCRPLVEKEAGRDLLNFPVQVVDLLGT 
3 7495 TCGGCACCAGCTCGGCCAGGTCGCCCGCCGTCACGTTGGCGAAGCCGACATGGCGGGCCAGGCCCTCGCGCACGAACCCCGCCAGCACCTCG 

< PVLEALDGATVNAFGVHRALGERVFGALVE 

3 75 8 7 GCGGTCTCCGCGAGCGGGACGGTCGGGTCGGGCCAGTGCACGGAGTACACGTCGACGTGGTCCGTGCCGAGCTGACGCAGGCTGGCCAGCAG 

<A TEAL PVT PD PWHVSYVDVHDTGLQRLSALL 
3 76 7 9 CTCGTCCCGGAGGAACGCGGGGTCGCTGTTACGCACCGTCCGTCCGGGCGGGTCGAGCTTGTGCCGTCGCACGCCGGGCCGGGTCTCCAGCC 

<EDRLFAPDSNRVTRGPPDLKHRRVGPRTELG 
37771 CGCCGGCCGTGGCGATCACGATCTCGTCCCGGTGCGCGGGCAACAGGTCGGCCAGCCCGCGGGCAAGGGCCGCCTCGGCGGCCCCGCCGCCG 

<GATAIVI EDRHAPLLDALGRALAAEAAGGG 
37863 TACGCCCGGGAGGTGTCGAAAAGGGTGACGCCCAGGTCGAAGGCCCGGCGGACGGCCTGCACGCCTGGTTCGATCCGCCGGCCCCACTGGCC 

<Y A R S T D F.LT VG LD FARRVA QV G PE I RRGWQG 
3 795 5 GCCGAGCGCCCAGGTGCCCAGGCCGAGCGCCGAGACCAGCGGCCCCCGCTCACCGATGCAGCGCTGCCGCACGGCGTCCTCCCGCCCGGTG 

<GLAWTGLGLASVLPGREG I CRQRV 
3 8046 CCCCACCCACGCCCGTCGGCCGGCCGTCACGCCGCCGGCTCCCCGTCGCTGCGCAGCGTCAGGAACACCACGCCGTCGTGATTGATCATCT 

<o AAPEGDSRLTLFVVGDHNIMK 
3 813 7 TGTCGGTGACCGGCGTCAGCCCGGCCTCGGCGGCGATGCCGGTCATCAACTCCACGCCGTGCCAGTTCAGGAACCCCTTGAACTCCACGGGG 

<DTVPTLGAEAAIGTMLEVGHWNLFGKFEVP 
3 82 2 9 TTCGCGTCGCGGTACCGCTCGGCGTAGGTGTGGAAGAAGCCGCGCGTGTTGTCGCCGAGGTCCAGGAAGTTGAAGCAGAACAGCCCGCCGGG 

<N ADRYREAYTHFFGRTNDG LDLFNFCFLGGP 
3 8321 CCGCAGGATCCGCCGGATCTGACGGAAGTAGAGGAAGACCTCGAAGACGTTGAGGTGGATGAACACGTTCAGGGAGAACCCGGCGTCGAACG 

<RLIRRI QRFYLFVEFVNLH I FVNLS FGADFA 
3 8413 CGGCGGTCGGCAGCTTCTCCAGGAAGTCGTTCTCGATGTGGTGGTAGGAGACGTTCTCCCGGCCCTCGCAGGTGGCGCGCGCCTTGTCGAGG 

<ATPLKELFDNEIHHYSVNERGECTARAKDL 
3 85 05 AAGG ATCGG CTG ACGTCGG CG C AG AG C ACG G CG CG C AC C CGGT CGG CG AG C C CGG C CG C C ATG ATG C C CT CG C CG CTG CCG ATCT CG AAG AT 

<F SRSVDACLVARVRDALGAAM 
38 5 97 CTCCGATTCCGGGCCGAGCCCGAGCTGCTCGACGACCAGGGCGACCTTGTCGACACGGTCCTGGAGGTACTCCTCGCGCGGCTGGTAGCCGG 
3 86 8 9 CGAGCTGCATCTGCATCTCCTCCGGCGTGTTCCACTCCCAGACGTAGTTGAGGTCGCCGGTGCTCCGGAGCGGACAGCCCCGGGGGGCGTCC 
38781 TGCGGCGGCGGCGTCGCCTGCTTCGCGTCGTTCATCGGACCTCACTCGCGGTCTTGTCGGGCTCTGCGGGGACAACTGGGCGGGCGGCTGCG 
38 8 73 GAAAACTGGGCGGGCGGTGTCACGGACGCCGGGCGAACTCGGCGACGTCCTCGGGGCTCAGCGCGCCGGTGTCGGCCAGCCGGGCCAGGAG 

<o PRRAFEAVDEPSLAGTDALRALL 
3 8 964 TTCGGCCACCTGGACGGCGGTCGGCCCGGTGGTGACGGACTCCCGCATCCGCTGCGCCGCCACCCGGAATCGGTGGTCGTAGAGGACGGAGC 

<EAVQVATPGTTVSERMRQAAVRFRHDYLVSG 
39056 CGAGGGCCTCGTCGACCTCCTCGCGGGACGCCTTCAGGCCGGGCAGCGTCTTCGTCGCGCCCTGCGGGTCGAGCCGCCGCCCGTAGATCAGG 

< LAEDVEERSAKLGPLTKTAGQPDLRRGYIL 

3 914 8 GCGTCGTAGTTGAGCGCCAGCGACAACTGCGGCACGCCCATGGCGAGCCCGTTCATGTAGCAGTTGGCGCTGCCGTGGTGCACCAGCAAGTC 
<A DYNLALSLQ PVGMALGNMYCNASGHHVLLD 

3 924 0 GCAGTCGGGGAGGATGAGCTCCAGCGGGCAGTTGCTGAGCACCCGCACGTTCGGCGGCAGCGCGCCCAGCCCCTCCACCTCGGAGGAGGCGG 
<CDPLILEL PCNSLVRVNP PLAGLGEV ES SAA 

3 93 32 CGGTGATCACGACCTCCACGCCCCGCTGGGCGGCGGCGTCGACGGCGTGCCGCAGCGCCGGCACCTGCGCGCCGAACACGCCCGTGGCGGAG 

< TI VVEVGRQAAADVAHRLAPVQAGFVGTAS 

3 94 24 TTGCCCCACACCACGCAGACCCGCTTGCCCCGGCGCGGACCGAGCAGCCAGGGGTCCACGTCCTGGGAGCCGTTGTAGGGCTGGTAGCGGAT 

<N GWVVCVRKGRRPGLLWPDVDQSGNYPQYRI 
3 9516 CGGGATCCGCAGCGCGTCGCCCATCGGCGGGATCGCCACGTCGGGCGACGGGTCGATGGCGTACCGGATCTGGTGCCGGCTCCACTCGACGC 

<P IRLADGMPP IAVDPSPD I AYR IQHRSWEVG 
396 08 CGTACTTGCGGAACTCGGTCACCGGGTCGCCGGAGACCAGGTCGAGCCCGGGCTCGGTCTCGATGGTGCCGATGAACCCGGGCGAGAAGTAG 

<YKRFET "VPDGSVLDLGPETE I ;T G I FGPS FY 
3 9700 ACGCTGGGGATGTGGTGCAGCTCGGCGACCAGCGCGCCCTCCACGGCCATGATGTCGTGGACCACCAGGTCGGGCCGGTAGTGGGCGGCGTA 

<VS PI HHLEAVLAGEVAMI DHVVLD PRYHAAY 
3 97 92 G TCG AC CGCG TTGT CGT AG CTG CG CTGG AC CG CGGTG AC CGTG CG CTTC C AGT AGT CGG CG AGC AGGT CGGTGT CG AAGTCGG CG AGCG AGT 

<DVANDYSRQVATVTRKWYDALLDTDFDALiSD 
3 98 84 CCATCGGCCGGCCCGTGAAGGGGTTCAACGGCAGCGGTTGCTCCACCATGTGCTGCGGGGTGTAGAGGGCCTGGACGTAGAAGCCCAGCCGG 

<MPRGTFPNLPLPQEVMHQPTYLAQVYFGLR 

3 9976 G CGCT CT C CATC ATGTCGGGT C CGT CG AG C AC CG AG ACGGG C AT C ATG C C CG C CG CGG CC ACG C C C CGG AC CTGGG ACGG CG AAC AGG CG AC 

<A SEMMDPGDLVSVPMMGAAAVGRVQSPSCAV 

4 0 06 8 CTTGACGTCGTGGCCGGCCGCCCGCAGCGCCCAGGCCAGCGGCACCATGCACATGTAGTGCCCGGCCCAGTTGGACACGGTGAACAGAACCT 

<KVDHGAARLAWALPVMCMYHGAWNSVTFLVK 
40160 TCATCGCAGCCTCTCTGTGGCCTGCCGAGGGGAGGTTGGGGGTCGCGGCCGGACGGTCAGGAGGTCAGGACCGGCAACTCCCGCGCGGGA 

<M <oSTLVPLERAP 
4 02 50 TCGGCGAACTCGATCACGATGGTGCGCCGCCACTGGTCGGAGGGGTTCGGCCCGGAGCCGTGCACCAGCCGTACGTCGTGGACGATGAAGTC 

<D AFEIVITRRWQDSPNPGSGHVLRVDHVIFD 
4 034 2 CCCCTCCTGCGAGGGGACCGGCACCCGCGGCCCGGCGTCGCGCACGGCCGTGACGTCGGCGTCGTCGGGCAGCAGGTGGGAGCCGGGCACGC 

<GEQS PVPVR PGADRVATVDADD PLLHSG PVG 
4 0434 CCTCCAGACAGCCGTTCTCCGGCCCGGCGGTGTCCAGGCAGATGCTGATGTTGCAGACCGCGTGCGGGGGGACGTTGACCCGGTCCCGGTGC 

< ELCGNE PGATDLCI S INCVAH P PVNVRDRH 
40526 C ACGG CACG C CGG CCG CC CG CAGTGG CT CCTTG AGC ACCAACGCG AAGG CGGTGGG C ACG ACCGGGGTG C CG AGG ACGTCGG CGGCCACCGC 

<W P V G A A R L PE KLVLAFAT PVVPTGLVDAAVA 
4 0618 GGCGATCTCCGGCCGGTGCAGCAACTCGCCCTGCGGCCAGTCCTGCTTTTCGAGGTTGTGGATCCGGTACAGCACCGGCTCCGCGCCCTCGA 

<AI E PRHLLEGQPWDQKELNH I RYLVPEAGEV 
4 0710 CCTCGTAGTTCCAGTAGTCGGCGTTGGCCCGCGCCGGACCGGCGAACCGGTCGATCAGGCTTACCGCGCCGGCCTTGAGCTGGGCCAGCACC 

< EYNWYDANARAPGAFRDI LSVAGAKLQALV 
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4 0 8 0 2 TCGGGGTCG AGCACCGGCCCGACGTGCGCG ATCCCGTCGCTGCGG AACCGGCTGGCCACGGCCTCGCGCTCCCG CGACATGGTGGTCATCG 
<EPDLVPGVHAIGDSRFRSAVAERE RSMTTM 

< o R 

4 0 8 93 GAGGCTCACCCCTTCGGTCGGTCGGCGCGTGCCGGTGCGGCGGCCGACGGCGATGATGTCGCACACCTCCGGCGACCGGCGCTCGTGCAGC 
<LSVGET PRRTGTRRGVAI I DCVEPSRREHL 

4 0 9 8 4 ACCTCCACCTCGGCGAAGCCGGCGTTGTGCAGGGCGGCGAACAGGGACTCCCGGTCGAGCCAGCGCACGTCGACGCTGAGCCCCCGGGCCTG 
<V EVEAFGANHLAAFLSERDLWRVDVSLGRAQ 

410 7 6 CGGCTCGGGGTGCTCCTCGCGGACCTGCTTGACCGTGTAGCCGTCGATCGGTTGCAGGTCGCCCACGCCTCCCCAGTAGTGGGTGGAGAGGT 
<PE PHEERVQKVTYGDI PQL.DGVGGWYHTSLY 

4116 8 AGATTCCCGCCGCGACGCCCGCGATGTCCTTCAGCAGGGTCCACGGCTCACGCACGTGGTAGAGCAGGCCCGCGCAGAGGACGGCGTCGAAC 

< IGAAVGAIDKLLTWPERVHYLLGACLVADF 

412 6 0 TCGCCCAGCTCGGTGAAGTCGATCCGCTCCACGTCGGCGACGCGCAGCTCCACGTTGGTGATGCCGTTGACCTCCATCACCAGCTCCGCGCG 

<E GLETFDIREVDAVRLEVNTIGNVEMVLEAR 

413 5 2 GCGCAGGTTCTCCGGACGGCCCTCCAGGGCAAGCACCGTCGTGCCGGGGTGCCGGGCGAGGGCAAGCGTGTCCGCGCCCTCCAGTGCGCCGA 

<RLNE PRGE LALVTTG PHRALALTDAGELAGL 

414 4 4 GTTCGAGGATCCGCCGCGCGTCGGGAAACGCACCGAAGAACTTCGCGGCCCGGTCGGCCGGGGACTGGCTCAGCAGATAACCGTGCTGGGAG 

< E L I RRAD P FAG FFKAARDA P S QS L LYGHQS 

415 3 6 CCCTCGGCGTAACGCACTCCGTCGTGCTCGAATCCATTCACCCACGGCTCGAGCGCGGCGACGCGGCGACGAATCTCTTCACGGTCCATGG 

<G EAYRVGDHEFGNV 
4162 7 GATCAAAGCCTAGCGATGCCATTGCGGTTGCGGACTAGTGTTTTCATCATATTCAGCGGCTCGCCGTGCTGAGCCTTTCGTTGACCAGCCG 

<o RSATSLRENVLR 
41718 GGCCCATTCCGGAATCCGGTCCCCGCCGATCTCGATATCGAAGGACGACTGGAGTTCCGCCGGCCCGGCGTCGGCCGCCCACCTGGCGGTGC 

<AWE P I RDGG I E I DFS S QL E A PGAD AAWRATR 
41810 GGGCCAGTCCCTCGGCCAGCGGGGTGTCCGTCCAGTCGCCGAAGACCGATCGGGCCAGCTCCGTCGCGGTGTACGCGGTTCGCACCTCGTCC 

<ALGEALPTDTWDGFVSRALETATYATRVED 
41902 CGCGACGGCAGGTGGGCGATCGGATGCTCCGGCACGCCGGCCGCCGACCGTACCGCCTGGGCCAGCTCCAGCACGGTGTTGGTGCTCGACGA 

<R SPLHAI PHEPVGAASRVAQALELVTNTSSS 
41994 GCCCACGTTGAATGCCCGCCCCCACGCGGCCTCGGTCTCGGCGGCGCGGCTGACCACGTTCACCACGTCACCGACGTACGTGAACGCGCGGA 

<GVNFARGWAAETEAARSVVNVVDGVYTFARV 
4 2 0 8 6 CCTGGCCGCCGTCGCCGTACACGGTGATCGGCTCGCCTCGCAGGATCTGGTTGAAGAAGATGGCGACCGCGTTGCGGTACGGGTCCCGCATG 

<QGGDGYVTI PEGRLIQNFF IAVANRYPDRM 
4 2178 TTCTGCCACTCGCCGTAGACGTTGTGCATGCGGAAGGCGGTGAAGGGCAGCCCCTGGGTCCGCATCGTCACCTCCAGCTCGCGCTCGACCAG 

<N QWEGYVNHMRFATFPLGQTRMTVELEREVL 
4 2 2 7 0 GTACTTGGCCAGGCCGTAGCTGTCCGCGGGGACGGGGACGACGGACTCGCGCATCGGCGTCTCGCCGTGGCCGTAGACCGCCACGGAGGAGG 

<YKALGYSDAPVPVVSERMPTEGHGYVAVSSA 
4 2 3 62 CGAAACAGAAGAACCGCACGCCGGTACGCAGCGACGCGTTGATCAGATTTATGCTGCCCATCACATTGGTGCCGTAGTTGAGCTGCTTCACC 

< FCFFRVGTRLSANI LNI SGMVNTGYNLQKV 

4 24 54 GAATGGCTGATCGCCTCCGCCGCGAAGGCGGCAAAGTGGAAGACCCGCTCGAATCGGTTCTCGGCGAACAGTGAATCGACGAAGTCCACGTC 
<S HSIAEAAFAAFHFVREFRNEAFLSDVFD VD 

4 2546 GGTCACCGAACCGACGGCCAGGTCCACCCCGGCCGGAACCCGCTGCCGGCTGCCGCCGCTGAGGTCGTCCAGAACGGTGACCCGGTGCCCAT 
<TVSGVALDVGAPVRQRSGGSLDDLVTVRHGN 

4 263 8 TC CTG AC C AATG ACTCCACC AGGTG CG AG C CG ATG AATCCGGCAC CAC C AGTC AC CAGAC AACG AACCATCCGGGGCTCCTTCGT CAATA 

< RVLSEVLHSGI FGAGGTVLCRVM 

4272 8 GATCCGGAAAGGGTTGACCGAGCGGGCGATCTGCGGCGCGCCCTTCGTCGGCGAACACCGACCCCCGAGAGAAAGCTTCGCCTCAGGGCAC 

< o p V 

4 2 819 CGGCGACCGGTCGGCCTGCTTCTTCAGCGGCTCCCACCAGTCCCGGTGCGTCCGGTACCAGTCGATCGTCTCGGCCAGGCCGTCGGCGAAGG 

<P S RDA Q KKL P EWWDRHTR YWD I TEALGDAFA 
4 2 911 CGACCTCGGGCCGGTAGCCGAGCGCCCGCAGTTTCGCGTCCGTCAGCGAGTAGCGGCGGTCGTGGCCCTTGCGGTCCGGCACCCGCTCGACC 

<VEPRYGLARLKADTLSYRRDHGKRDPVREV 
4 3 003 CGGTCCCACCCGGCCCCCAAGGCGTCCAGCAGCCGCCCGGTCAGCTCCATGTTGGACAGCTCAGCCGTGCCGGCGATGTGGTAGACCTCGCC 

<R DWGAGLADLLRGTLEMNS LEATGAIHYVEG 
4 3 0 95 GGGGACACCGCGGTCGACGACGGTCTGGATGCCCCGGCAGTGGTCCGTCACGTGGATCCAGTCGCGGACGTTCCCGCCGTCGCCGTACAGCG 

<PVGRDVVTQ IGRCHDTVH I WDRVNGGDGYLP 
4 318 7 GCACCCGTCGCCCGTTCAACAGCTCGGTGACGAACAGCGGGATCAGCTTCTCCGGAAACTGGTACGGCCCGTAGTTGTTGCCGCACCGGGTG' 

<VRRGNLLETVFLPILKEPFQY PGYNNGCRT 
4 32 79 AGGCAGACCGGCAGCCCGTGGGTGCGGGCGTAGGCCAGGGCGATCAGGTCCCCGCCGGCCTTCGCCGCCGCGTACGGGGAGTTCGGCGCCAG 

<L CVPLGHTRAYALAI LDGGAKAAAYPSNPAL 
4 33 71 GGGGGTGTCCTCGGCCCAGGAACCCTCGTCGATGCTGCCGTAGACCTCGTCGGTGGAGACCTGGACCACCCGGGCGACCCCGGCGTCGAGAC 

<PTDEAWSGEDI SGYVEDTS VQVVRAVGADLC 
434 63 ACGCCTGCATGAGCGTCTGGACGCCCTGCACGTTGGTGCGGACGAACTCCGCCGAGTCGGCGATGGACCGGTCGACGTGCGACTCGGCGGCG 

<AQMLTQVGQVNTRVFEASDAI SRDVHSEAA 
4 3 5 55 AAGTTG AC C ACC ACGT CGTGC CCGGG C AGC ACCT CGG CC AG C AG CG CCGTGT CG C AG ACGTCGC C CTGGACG AAGGTGATC CGGT CCTGG AC 

<F NVVVDHGPLVEALLATDCVDGQVFTIRDQV 
4 3 64 7 CGGTTCGAGGTTGGCGAGGTTGCCCGCGTACGTCAGCTTGTCCAGCACCGTCACCCGGGCCTGCGCCGTGTCGGGGTAGGCACCGGTGGCCA 

<PELNALNGAYTLKDLVTVRAQATDPYAGTAL 
4 3 7 3 9 GGTCGCGGACGTACTGCGAGCCGATGAAACCGGCACCGCCGGTGACCAGGACGCGACGCATCAGACCCCCACCCGGACTTCGCTGTGAT 

<DRVYQSGI FGAGGTVLVRRM 

<oVGVRVESHD 
43 8 2 8 CGCCGAGGACGAACCGGTGCGTCTTGGGCACCCGGGGCCCGGGGACCACCCGCGCCTCCCGGCCGATCATCGAGAACTCGATGCGGCCGATG 

<GLVFRHTKPVRPGPVVRAERGIMSFEIRGI 

43 920 CCCTCGATGTAGGCACCGCGCAGCACGATGGAGTGCTCGATCTCGGTCTCCAGCAGGGTGCAGTCGCAGTCGATCGAGGTGTACGGGCCGAG 

<G EIYAGRliVISHEIETELLTCDCDISTYPGL 

44 012 GTAGGAGTTGCGGATGATCGAGCCGGCTCCGACCACCACCGGGCCGACGATCCGGGAGCCGCTGACGTCGGCGCCGGCGCTGATCACCACCG 

<YSNRI I SGAGVVVPGVIRSGSVDAGAS IVVP 
44104 GGCCGATCAGTTCGGTGCGGTCGTCGACCTTGCCCTCGACCAAAGGCTCGACGCTGCCGAGGACGAACCGGTTCATCTCCAGCATGTCGGCG 

<G I LETRDDVKGEVIjPEVSG LVFRNMELMDA 
44196 AGGTTGCCGGTGTCCTTCCAGTAGCCCGTGATCATGGTGGAGTCGACCCGGTGGCCGCGGTCGATCATCCACTGCACCGCGTCGGTGATCTC 

<LNGTDKWYGTIMTSDVRHGRDIMWQVADTIE 
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4 4 2 8 8 CAGCTCGTTGCGCCAGGACGGCTTCAGCTCGGCCACGGCATCGTGGACCACCGGGCTGAAGACGTAGACCCCGACCAGGGCCAGGTCGCTCT 
<LENRWS PKLEAVADHVVPS FVYVGVLAL D"* S K 
4 4 3 8 0 TGGGGTGCTCGGGCTTCTCCTCCACCCCGATCACCCGGCCGTCCGCGCCCATCTCGGCGACGCCGAAGGCGTGCGGGTCGGCGACCCGGGTC 

< PHEPKEEVGIVRGDAGMEAVGFAHPDAVRT 

4 4 4 72 AGCATGATCTGCGCGTGCGGTCGCTCCTGCCGGAAGCGCTCGACGATGTCCTTGATCCCGCCGACGATGAAGTTGTCGCCGAGGTACATGAC 
<LMIQAHPREQRFREVIDKI GGVI FNDGLYMV 

4 4 5 6 4 GAAGTCGTCGTCGCCGAGGTAGTCGCGGGAGATGAGCACGGCGTGCGCCAGCCCCCGGGGAGCCTCCTGCGGAAGGTAGGTCACCTGGAGGC 
<FDDDGLYDRSILVAHALGRPAEQPLYTVQLG 

4 4 6 5 6 CGAACTGGGAACCATCGCCGACCACGCGCTGAATTTCGGGCGCGGTGCTGCCGACGACGATGCCCACCTCCTCGATACCGCCCTCACGAATA 

< FQSGDGVVRQIEPATSGVVIGVEEIGGERI 

4 4 748 GCCTCGAGCCCGTAGAACAGCACCGGCTTGTTGGCCACGGGAATGAGTTGTTTGGCGGACGTGTGGGTGATCGGACGCAATCTCGATCCCAC 
<A ELGYFLVPKNAVPILQKASTHTI PRLRSGV 

4 4 84 0 CCCTCCCGCCAGGACCAGCGCCTTCACGAACGCCCTCGAAAGGATGGGACCGAGACGGGTCGCTGGTTCACGAGCACTCCAGGGGTCACGG 
<GGALVLAKV 

4 4 931 TGGACTGGGCTCTTCGTGAACGTACCGAAGGATCACTCGTGATTTCCCTACTTATGGGCCACCGAGGTGTGATCGGTGGATCTCTATGCGT 

< o A D 

4 502 2 CCGCCATTTCCGCAAACGGGGGCCTGGCCGGCCCGCCGCCGACCAGTTCGAGCACGGCGGCCAGGCCGTGCACGGTGTCCGGCGGGCTGGCC 
<AMEAFPPRAPGGGVLELVAALGHVTDPPSA 

4 5114 GGCAGCAGCAGGGTCTGCAGGCCGGCGTACACCGCCCCACCGTCGGCCAGGGTGTCGCCCACCATCAGCGCCCGCTCCGGGGCGACCTTCAG 
<P LLLTQLGAYVAGGDALTD GVMLARE PAVKL 

4 520 6 CTCGTCGCAGGCGGTGCGGAAGATCCGGGGATCGGGCTTGACCGCCCCCACCTCGTGGGAGAAGACGTAGGCGTCCACCAGCTCCGCCATCC 
<EDCATRF I R PDPKVAGVEHS FVYADVLEAMG 

4 52 9 8 CGTACGCCGCGAAGGTGGGCCGCAGGTCCCAGGCGATGTTGCTGACCACGGCGGTGCCCACGCCGCGCCGGCGCAACTCCGCCAGGACCGGG 

< YAAFTPRLDWAINSVVATGVGRRRLEALVP 

4 53 90 G CGGCGTCG CGGTAGGG CAG CCAG C CGTCCGTG CGG AACAG CCGGTCGT AG AG CG C CT CGG CG AGG C CGGG ATGGGG C AGG TCG ACGG TGG A 
<A ADRYPLWGDTRFLRDYLAEALGPH.PLDVTS 

4 54 8 2 GAGCAGGCCCACGTACGCGCTGCGGTGCGTTTCGGGCGAGAGGTCCCGGCGGGCGTACACCTCGGCGAGCCCCGGCGGGACGGCGTGCGGCT 
<LLGVYASRHTEPSLDRRAYVEALG PPVAH P E 

4 5 574 CCGGCCCGCCGGGACGGCCGGCCGCCAGCAGCAACCCGGTCAGGGCCTCCTGCTGCGCCGGGTCGAGCTGGACGCCGACGGTGGCCGCCGCC 

< PGGPRGAALLLGTLAEQQAPDLQVGVTAAA 

4 5 666 GCCCGCAGCCAACGCTGCGGCAGTTCCACGGCGAACAGCGTGCCGGAGAAGTCGAACAGGACGGCGTCGATCGGACGGGGCAGGGGGGTCGT 

<A RLWRQPLEVAFLTGSFDFLVADI PRPLPTT 
4 5 7 5 8 CAT CGCT CTCCT CGGTG C AG C CG CG T C AGGGCGGCACC AG CCTGG CAG CGG AC C AG C CGG ATGTCC AT CATGG AGG AATGCGCCGGGTCG 

< M 

4 584 8 GG CG CG CC CG CCATGG C CG CCGG T C CGG ACG AC AGG C ATTTT CGGTC ACTCTTG C CTTCT AGG CG G ATTTCTT C AAAG ATGG CTGT C AATTC 
4 5 940 TTCAGCGATCCTGGAGGCATCCGTGACCCGTACCCGAACCGCCCTGGGCCGGCTGCTCGCCGCCGGTCTGGCCAGCCTCGCCACCGCTGC 

>VTRTRTALRRL LAAGLAS LATAA 
4 6030 CGCGACCCTCGTCGCCACCGCCGGCCCGGCGGCGGCCGCCACCACCCCCGGCATCGACGTGTCCCACTACCAGGGATCGATCAACTGGACGA 

>ATLVATAG PAAAATTPG I DVSHYQGS INWT 
4 6122 GCGTCCGCAACGCGGGCATCCAGTTCGCGTTCATCAAGGCCACCGAGGGTACGAGCTACAAGGACCCCAACTTCAACGCCAACTACGTCAAC 

>SVRNAGIQFAFIKATEGTSYKDPNFNANYVN 
4 6214 TCCTACAACGCCGGAGTGATCCGGGGGGCGTACCACTTCGCCCGGCCGAACATCTCCTCCGGCGCCACCCAGGCCAACTACCTGGCCAGCAA 

>S YNAGVI R GAYH FARPN I S SGATQANYLASN 
4 63 06 CGGCGGCGCCTGGTCGGCGGACAGTCGCACCCTGCCGGCCGCGCTGGACGTCGAGGCCAACCCGTACAGCGGCGGCACGTGCTACGGCCTCA 

>GGAWSADS RTLPAALDVEANPYSGGTCYGL 
4 63 9 8 GCACGTCCGGGATGCGTAGCTGGATCCAGGACTTCCTGAACACGTACAAGGCCCGCACCGGCCGGTACGCGGTCATCTACACCACCACGAGC 

>STSGMRSWIQDFLNTYKARTGRYAVIYTTTS 
4 64 9 0 TGGTGGAACCAGTGCACCGGTAGCTGGACCGGGCCGTGGGCCAACCACCCGCTGTGGCTCGCCCGCTGGTCGAGCACCCCGGGCACCCTGCC 

>WWNQCTGSWTGPWANHPLWLARWSSTPGTLP 
4 6 5 82 GGCCGGCGCTTCGGTCTGGAGCTTCTGGCAGTACACGGCCTCCGGCAGCGTCTCCGGGATCAGCGGCAACGTCGACCGCAACAACTGGAACG 

>AGASVWS FWQYTASGSVSGI SGNVDRNNWN 
4 6674 G CG AC CGC ACCCGG CTG ATCG CG C TGG CG AACAAC AC CTG ACCCG AACGCC CGGG TAGGCGGTTGG CGG CAG CGG AACCG ATTG CG AC CGT 

>G DRTRLIALANNT® 
4 6 7 6 5 ACGGTCGGCGGCCGGTCCGGCTGCCGCCAGCCCGTCACCCGGCCCGCGTCGCGGGCCGCGCCGGCATCCGGTCCTGCCGGCCCGGGGTGGCC 
4 6 857 CGCCGTGCCATCCGCCAGGCGGCCACGCCTGCGACGGCGCCGGCCACGACACCGAACAGCGCGAACAGCACCGGAAGACCCGCGCTGACCAG 
4 6 94 9 CAGCAGCACCACGTCGCCGAAGGCGACGAGCATCCACAGTGCCAGCCCGGGTCGAGCGTCCCTGCGTCGGTGTCCCATGTCGCACCTCCTCG 
4 7041 CAT CGTCCGGG AT C AG AT ACCC CG TTCG ACG C AAGT AC ATGCGAATCG AC AT CG C AA CGG CGGG AATCGCCTG CT CAG CGGG CCG AGTCGG C 
4 7133 AATGGACGGCCCCGGCCGGGCGCTCAGTCGACCCGGTCCGGCTTGAAGCCCTTGGCGATCCGGTCGAAGTCGGCCAGCCGCGCCTGCCAGT 

<o DVRDPKFGKA I RDFDALRAQWD 
4 72 24 CCTTGTTCGCCACCTCCCACCGGAGGGCGTACCCCCGGTTGCTGGCCGTGACGAAGCCCCGGTTGCGGACGTGGATCCGAGTGCCGTCCCGG 

< KNAVEWRLAYGRNSATVFGRNRVHI RTGDR 

4 7316 TTCTCCAACCACTCCCAGTCCGCGCACGTCTTGTAGTAGTCGCAGCGCTTG ATGCTCAGATACTGGTAGCCGTTGACGTAGTTCTTCCGGGC 

<N ELWEWDACTKYYDCRKIS LYQYGNVYNKRA 
4 7408 CGGTTCCTTCTCTTTCCAGTCGGCGTAGGCGTCGCCCTCCGGGGTGCTGGTCCACTGCACCAGCAGCTCGCCCACCCCGTCGCGCTCGTCGA 

<PEKEKWDAYADGEPTSTWQVLLEGVGDREDF 
4 7500 AGACGATCGTGTTCTGCCCCACGCTGCGCCGCACCCAGCCCTTCGGCAGGGGCAGCGAGAAGCCCGCCGGGTCCTTGTGCAGGAGCCAACCC 

<VITNQGVS RRVWGKPLPLS FGAPDKHLLWG 
4 75 92 TCGGGCAGGGCGTTCGGGTCCACGGACGGGCTGGCAGACGGCGTCGGGCTGGGCGGAGCCGCGCTGCTGGTCGCCGGTGCGGCCGACGGCGT 

<EPLANPDVSPSASPTPSPPAASSTAPAASPT 
4 7684 CGGCGTCGACACGCCCGCCTGCGGGTCGCCGCCGGGCCCCGGGTCGTCGTCACCGCTCCGGCCGAGCAGCGGCACCGCCGCCAGCAGGCCG A 

<PTSVGAQ PDGGPGPDDDGS RGL.LPVAALLG I 
4 77 76 TCAGCAGCACCGCGACGAGCGCGCCGACCAGCAGGCCGCGGCGGCGACGCTCGGGCTTCGTGCCCCCGACGACGGTGGCCCGGCCCGTCGAC 

< LL.VAVLAGVLLGRRRREPKTGGVVTARGTS 

4 7868 GAGAGTGCCGGGCCGGAGGCGGCCGGCAGCACCGACGTCGGGGCGGCCTCCTCGCGGGGCGCGACGGCGGGCTCGCGGGCCGGGGCGGGCGG 
<S LAPGSAAPLVSTPAAEERPAVAPERAPAPP 

4 7960 CGAAACGGGCACAGCCGCGCCCGGATCGACCCGGGTGTCGTCGGCTCGGGTGTCGTCGGCCCGACCGGCCGCCGGGGCCCCGTCGGCCCCTT 
<SV PVAAG P D VRTDDARTDDARGAAPAGDAG E 

4 8052 CGTCCCCAGCCTCGGCGGCCCGGTCCGCCGGCGGCCCGTCACTCCCTCCGGCCGGCGTGCCCGCAGCCTCGTCGGCGCTGGTCGCGGCGTCG 

< DGAEAARDA PPGDSGGAPTGAAEDASTAAD 
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4 8 1 4 4 GGGGCGGGGATCTTCGCGGTGGGATCGGCCGCCTGCGGGCCGAAGTCGGTGACCTTCGCGGTCGGATCGGCGTCGGCGCTCGGCAGGTCGAC 
<P API KATPDAAQPGFDTVKATPDADASPLDV 

4 8 2 3 6 CATCGCGGTCGGCGCGGCGTCCACCCCCGCCGGGACCTTCGCGGTCGCGTCGCCGCCGCCCACGCCGGCCGCCGCCGCGCCACCGGCGGCAA 
<MATPAADVGAPVKATADGGGVGAAAAGGAAL 

4 8 3 2 S GACCACCTTCCGGGGTACGCGGTGCGGGCGGGGCCGCCGGCCGGCCGGTGCGCTG7TCGGCGGGGCGCGGTGCGGGCACCACCGGCGGTCGC 

< GGEPTRPAPPAAPRGTRQEAPR PAPVVP PR 

4 84 2 0 GGCTCGCGCGGGCCGTTCGGCCCCGGCCGGCGTACGCCGTCGAGCAGCGAGATGGTCTTGGCGCGACGCCCGGCGGCCCGCCGCAGGAGCCG 
<P ERPGNPG PRRVGDLLSI TKARRGAARRLLR 

4 8 512 CTCGGCCACCTCGGCGTCGATCCGCTCGGCCGGGTCCTTGCGGAGCAGCCCGTTCAGCACCGGCTTCAACGGGCCGGCGTTCCTCGGCGGCG 
<EAVEADIREAPDKRLLGNLVPKLPGANRPPP 

4 86 0 4 GCATCGGCTCGGTGGCCAGCGCCGCCAGGGTGGCGATCGCCGACGGTCGGGCGAACGGCGACTTGCCCTCCACCGCCGCGTAGAGCGTCGCG 

< MPETALAALTAIASPRAFPSKGEVAAYLTA 

4 86 96 CCCAGCGACCAGAGGTCCGCCTCCGGGCCGGCGGTGCCGTCCCGCGCCCGCTCCGGGGCGATGTACGCCGGGGAGCCCAGCACCATTCCGGT 
<G LSWLDAE PGATGDRARE PA I YAPSGLVMGT 

4 8 78 8 CCGCGTCACGTTCGGGTCACCGGGGATGGTCGCCAGGCCGAAATCGGTCAGCACCACCCGGCCGTCGGTGCCGAGCAGCACGTTGCCCGGCT 
<RTVN PDG P ITALGFDTLVVRGDTGLLVNG PK 

4 88 8 0 TGATGTCCCGGTGCATGACGCCGGCCTTGTGCGCCGCCTTCAGCGCGCCGAGCACCCCCAGGCCGATCTCGACCGCCTTGGCCGGCGACACC 

< IDRHMVGAKHAAKLAGLVGLG I EVAKAPSV 

4 8 9 72 GGCCCGTCCTCCGCGAGAGTGTCCTGAAGGGACTTCGACGCCACGTACTCCATGACGATCCACGGGTCGCCGTCGGTGCGCAGAACGTCGAA 
<P GDEALTDQLSKSAVYEMVIWPDGDTRLVDF 

4 9064 GATGCGGACCACGTTGACGTGGTTGAGTCGCGCGATGGCACGGGCCTCCCGCAGCGAGCGTTCCCGCATCTCGCGGCGCTCCTCCGGGGTGA 
<IRVVNVHNLRAIARAERLtSRERMERREEPTL 

4 915 6 GGCTGGGCGGCGGGACCAGTTCCTTGATCGCCACATCCCGGTGCAGCACCTCGTCGCGCGCCTTCCATACCCGACCCATGCCACCCTGACC 

< S PPPVLEKIAVDRHLVEDRAKWVRGM 

4 92 4 7 GAGCGGCGAAATCAGCCGGTACCGGTCGGCAACGAGTTGGGGAAGCGCGTTCGACATCGGTGGAGACGGTACCCGGCGGCGGCCCGCCGCAC 
4 9339 ACCGCCGGCACGCCACTGTGCGACGAAGGTCAAGTTCGCGACGCGTACGCTGAACGGCATGTCTGCCGAAGAGCCGCTGTTCCGGGTGACCC 
4 9431 GGGGCGTTCCGACCGCCGAGGAACTGGCCGCGCTGGTCGGCGTCATCGTCGCCCGGACCCGCTCCGCCGCCGCTCACGCGCCCGCCGCGCCG 
4 95 2 3 TCGGCCTGGGCACGCAGCGGTCGGCCCGGCGGTACGGTGCTCGCCGCTCGCCCGGGAGCCTGGCGCGCCTCCGGTCTGCCCCGCTGACGCAC 
4 9613 TCGCCAGCATGTGGCCAGTGTGGCCCGCCGCCGAGGCGGGTGGCGGCGGGGGCTCCCGGAGGTCGACGACACCCGTTAACCTCACTCAGGG A 
4 97 0 7 GACGATGCCGTGACGGATAGGGAGGGTCGATGATTCCCGAGGAGGACCGACCAGCACCCTGGTTGAGCGGCTACGGCAGCATCGAGGCCGAC 
4 97 9 9 ATCCGCCAGATGCGCGAGTTCGCCGACAAACTCCAGGCGGAGGTCGAGCGGAACTACGCACCGCATCTGTCGTACATAGCGGACGACATGAA 
4 9891 GGCACAGATCCCCAACCCCGCCGACGCCTTCGTCGAACTGGTGCAGTTCCTCAAGGCGCACAACGAGACCCAGCAGGCCACCGCCACGATGG 
4 998 3 TCTGGTCCGTCGGTGGGGCCACCGGCCACCTGGCCACGGCAGCCGGCCGGATCGCCGAGCAGTACGCCGACTCCGACGCCTTCTCCGCCGCT 
500 7 5 CGCGTCTCCGACGTGGAGCGGGCCCTCGCCAACCCGGGCGCGGCGGGGCAGGGCCGCCCCACGCCGGCTCTGCCCGACCCGACCAGCCCGGA 
5016 7 CGGCGGGCCGGGGCGGGTGGTGCTGCCATGATCGAGCGGGGCAGCGGCCGG ACGTCCGGCCTCACCGACTGGCGGCTGATGGACCTGCTGAG 
50259 CATGTGGGCGTGCATCCAGGACCACGAGACCGCCGGGCACTGGAAGCAGGTCGCCGGCTGGCGCAAGGTTTGCGACCTCGCGCAGGCCCACC 
503 51 TGGGCCGCCTCAAGGAGTACCGGCGCGGCCTGGCCGAGGCGTGGCCACCGGCGACCAACGCCGCCGCCCGGGCGTACCTCGGCGAACTGGAC 
5044 3 GACCTGATCGACAAGGTGCAACGCACCCACGACGCGGCTGCCGCGAACTACGACGCCCTCGCGGCCGCCACCCGAGCGATCAGCAGCGCCCG 
5053 5 CACCGAACTCAAGCCGCTACACGACG AGTACGTGG AGAAACTCCAGCAGAAGCGGGCGTACGAGGCGACTACCGCCGATCCGAAGGCGCTGA 
5062 7 TGGGCAGCCGGCTGCCGGACAAGCCGGTGACTGACGCCGACCTGGAACGACTCAACGCCCAGGCCCGGAACCTGATGTACGGGTTGAGCGGC 
50719 GAACTCCAACAGGCCCAAGCAATGCTCCGCCAACCCCCTCCTCCGCACCGCCCAGGACGACAGCCGAACAACCCGGATGCTTACGAGAGCAC 
50811 AGGCACGCCTCCGGTCATCCCGCCCGTATTACCAGTCCCAACCATTAGGCCTTCAATTCCCTCAAGCGCTCAGCAAGCAGAGCCACGCCCTG 
50903 TTCCTGTCCAGAGCGCACCCCAAACAGGTCCTGTACTCGGGGGAATCGGGACGGGAAACACCACTTCGCCATCGGGCTCCATTTCTCCCGGC 
50995 ACCGGCCACCCAGTGACATCCAACCCGGCAGGAGGTGCTGGCCCTACCCTCCCGCCGAACTTGCCACCTGCCATCGCCCCCACTGGACGAGG 

junction marker 

5108 7 CAGCACAACCGGCCAGGTAGGAAACCAGCCCCGCTCAACCACGTCCTCACGACCATTCTTGCCAGGCGGCCTGATCGGCGGCGGGCCCGCGA 
5117 9 TGGGGATTAATCAACCCCTTGGCAATACTCCTCCGCGAAGGATCAACCCAATTGGAGGGGTGATCGGGGGCGGTGGGGCTGGAACCTCACCA 

512 71 ACCGGCGCTGCAGGCACTCGCCCGGGCTCGGG ACGTGGTCCCCACTTTAACCTGACCCCTGCGGCGCCTCCGAATGGCGCACCGCAAGGCGT 

513 63 CGTAAACACCTTAGGTCTG AATGGTGAGCCGTCACGCTCATCCCACCGCG AGCAAGGCG ATACTGACGGCCGCTACTGGGATCCCGACCATC 

514 5 5 CATGGGCAACCAGCGAGGGCGTTCCTCCAGTAGTCCGCCCGCCCGAGGCGGAGGGACCAATCGACCCAGGGCCAGCTATTGGCTTCGACCGG 
5154 7 TGAAAAACGCTCCTCCAGCGGGCGCTACCTTCACCTTCCTAGCAACATTGATGGTTGCATCAGCGTACGCCCCGAGCGCCGTGCGTGCGA 

> V R A 

5163 7 GTCCTGAAACCGATGGTGCCTTGCGAATACGGGCTGACCAGTGGCACCTTAACTACCTGAAGGCGGCGGAGGCTCAGAAGCTGTCACTGGGA 

>SPETDGALRIRADQWHLNYLKAAEAQKLSLG 
51729 GAAGGGGTTGTAGTAGCGGTCCCGGATACTGGCGTTGATCCACACCCCGACCTTCAGCGCAATCTAATCAAAGGGATTGACATCATTCCCGG 

>EGVVVAVPDTGVDPHPDLQRNL I KGIDI I PG 
518 21 GGGCAATGGAGATGGCCAGAAAGATCGCAACAGTCACGGCACTAGCATGGCTGGACTAATCGCAGCCCACGGACAGGGCCAGAGCGGCGCCT 

>GNGDGQKDRNSHGTSMAGLIAAHGQGQSGA 
51913 TAGGCATAGCACCCAGAGCCAAGATCATGCCAATCCTGTCTTCCGCGTCGAACAACCTCGGTGATGCAGACGGCTTGGCTGCGGGTATAGAA 

>L GIAPRAKIMPILSSASNNLGDADGLAAGIE 
52005 TTTGCAATCTCGCATGGGGCGGATGTCATCAATGTCTCCAGCGGAGGCGGCGCCAGCGTTCGACTCATCAAGGCAATCAGAGAGGCGGTCGC 

>FAISHGADVINVSSGGGASVRLIKAIREAVA 
52 0 97 CGCAGACATTGTAGTTGTCGCAGCCGCCGGAAACAGTCCCGAAGACATGACAATTGGCTATCCAGCCAGCGAGGAAGGAGTCGTCGCAGTTG 

>ADIVVVAAAGNS PEDMTI GYPASEEGVVAV 
5218 9 GCGGAATTGATCGACAGGGAGAGCATGCTTCAGTTTCTGTCGTCGGACCGGAAGTTGACTTAGTCGCACCGGCAGTCGACATCTACAGCACC 

>G GIDRQGEHASVSVVGPEVDLVAPAVDIYST 

522 81 AGTTACGACGGGAAGTACTCCAAAGGCACCGGTACGTCCAGTGCCACAGCGATAGTCGCAGGGGCCGCTGCTCTGGTCCGATCGAAGTTTCC 

>SYDG KYS KGTGTSSATAI VAGAAALVRS KF P 

523 73 CG ACCTGCC CG CCTCGG AGGT CGT C CAT CGCCTC ACTG C C ACCG CC ATCG AC AAAGGG CCG CC AGGGC ACG ACG AC CAGT ACGG CT ACGG CG 

> DLPASEVVHRLTATAIDKG PPGHDDQYGYG 

524 65 TTATCGACCTGGTTGCCGCGCTTACGGCAGACGTACCCCCGGTGGGCTTTGAGTCGGCGACGGCGGACGTGCCCGACGTGCCTGGGTCGACC 

>VIDLVAALTADVPPVGFESATADVPDVPGST 
52557 ACG ACGG CGGTCGCCGAGCCGGCAGGCGAGGGTG ACG ATGGGGCAACGGCCCGAGGTCTGGCCACGTTGGGAGTGATCGTGGCTGCTGCGGG 

> T T A V A E PAG EGDDGATARG LATLGVIVAAAG 
52 649 CGCTTGGGCGCTGGTCGCTCGACGGCGTAGGTTGAGCGACGACCCGCCCCCGCGGATCAGCCGGTGACCAGCCAGCCCTGACGCCATGTCG 

>AWALVARRRRLSDDPPPRI SR» 
52 74 0 GCGACATTGGGGGGTCGGGGTGGGTGGGATACCGCTATTTGGCCGACATGAAGTCGATCAACAGCGATCAACAGGCAGTGTGATCGGCGGGG 
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5 2 S 3 2 GTCGGTCGACGGCCAGGCGGCGTCGACGGAGGCGGGGGAAGCGGTAGCGTCGGCGACGTGCCGAACTCGATTTCACTCCGACTCGTGCTCGC 

>VPNSISLRLVLA 

52 92 4 GTCGGCGAGCCCTGCCCGTCGCAAGCTCCTCCACGCCGCCGGCATCGAACCCGACGTGCTGGTCAGTGGGGTCGACGAGTCCCAGGTGACCA 

> S A S PARRKLLHAAGI EPDVLVSGVDESQVT 

53 01= GCGAGCGAGCCGAGGATCTGTGCCTGGAGCTGGCCCGCCTGAAGGCGCAGGCGGTCGTCGGCCGGCTGCGCCCGTCGGCCGACGAGCGGACG 

>SERAEDLCLELARLKAQAVVGRLRPSADERT 
5310c CTGGTGCTCGGCTGCGACTCGGTGCTCGCCTTCGACCGCGAGATTCTCGGCAAGCCGGCCGACGAGGCGGACGCTACCCGGCGTTGGGAGCG 

>LVLGCDSV LAFDREI LGKPADEADATRRWER 
5 3 2 C C GATGCGGGGGCGCAGCGGGGTGCTACACACCGGGCACTGCCTGATCGACGTCATCCACGAGACGCGCGCCGAGGCGGTCGCCTCGACCACCG 

>MRGRSGVLHTGHCLIDVIHETRAEAVASTT 
5 3 2 S 2 TGCGTTTCGCTGACATCAGCGACGAGGAGATTGCCGCGTACGTCGCGACGGGCGAACCGCTCGCGGTCGCCGGCGCGTTCACCATCGACGGA 

>V RFADISDEEIAAYVATGE PLAVAGAFTIDG 
53 3 5 4 ATGGGCGGGGCGTTCCTGGAGGGTGTCGACGGCGACCCGGGCACGGTGGTCGGCCTCTCCCTACCGTTGCTGCGCCGGCTTCTCGGCGAGCT 

>MGGAFLEGVDGDPGTVVGLSLPLLRRLLGEL 
53 4 7 S GGACCTGCGGATCATCGACCTGTGGACGAAGGTCGCGCCGGGCGGCCAGGCGGTCGAGGCGGTGGGTACGGTCCAGCCATGACGACGAAGT 

> M T T K 

> D L R I IDLWTKVAPGGQAV'E AVGTVQP© 

53 5 6 7 CCCTGCCGCTGACCCCGGAACTGCATGCGTACGTGGTGGCCCACGGATCGGACCCGGACGAGGTGATGCGGGATCTGATCGAGGAGACCCTC 

>S LPLTPELHAYVVAHGSDPDEVM RDLI EETL 
".-53659 G CCGCGCTGC C CG CCG AGG CG AGG ATG C AGGTGGCCC CGG AGC AAGCCGCGTTCCTG ACGTTCCT CACCCGGTTG ATCGGGG CGCGGCGGG C 

>AALPAEARMQVAPEQAAFLTFLTRLIGARRA 
53 751 GGTGGAGGTGGGCACCTTCACCGGCCTGTCCTCCCTGGCGATCGCGCGCGGGCTGGCCGAGGGCGGGCGGTTGACCTGCTTCGACATCTCGG 

>VEVGTFTGLSSLAIARGLAEGGRLTCFDIS 
53843 AGGAGTACACGGGCGTCGCGCGGCGGTACTGGGCGCGGGCCGGGGTGGCCGACCAGATCGACCTGCGGATCGGGCCGGCCGGGGACACGCTG 

>E EYTGVARRYWARAGVADQ I DLRI G PAGDTL 

53 93 5 CGCGGGTTGCCGTACGAACGGCACCTGGACTTCGCGTTCATCGACGCGGACAAGGTCGGCTACCCGGTCTACTGGGCGGAGTTGGTGCCCCG 

>RGLPYERHLDFAFIDADKVGYPVYW AELVPR 

54 027 CATGCTCCCGGGCGGGGTCATCGCGGTGGACAACACGTTGCGCGGGGGCCGGGTGCTCGCCCCGCGTGACGCCGACGACCGGGCCATCGCCG 

>MLPGGVIAVDNTLRGGRVLAPRDADDRAIA 
54119 CGTT C AACG ACG AGG TG ATGG C CG ACG T CCGGG TGG AG CCGGTG CTGCTG C CG ATCG CCG ACGGG CTG ACC CTGG CC CGGGTGCG CTG ACG 

>A FNDE VM AD VRVE PVLLP I ADG LTLARVR © 
54210 GGCGCCAGCCGGACGATCGTGCCAGGTCGCCGGCCCGGCGTCGTGTGTCCGCCTCACCGGGTCGACGAGGGGGTGAGGCGGGGCGGGGTGTT 
54 3 0 2 AGG AAGGGG CC CTTCCT ATAC CG AATG CGTT AACAAGGGG CCCCT CCTT AC ACCTC AACCT CAG CGC ACGCTG CGGGCG AACTGG CGGG CG 

<» RVSRAFQRA 
543 S3 GCCCAGGCGACGCCGACGGCGGCAAGCACCGCGATGATGGTCAGGCCCTGCCAGACCTTGTCGTTGCCGAGGTCGCCGGCGAAGAGGGCCCG 

<A WAVGVAALVAI ITLG.QWVKDNGLDGAFLAR 
544 8 5 GGTGCCGTCCACGGCCCAGGAGAACGGGTTCCACTCGGCGATGCGCTGGAGCCAGCCGGGGGCGAAGGTGAGCGGCAGCAGGATGCCGGAGA 

<TGDVAWSFPNWEAIR'QLWGPAFTLPLLIGS L 
54 57 7 GCAGCAGCACCGGCTGGGCGACGGTGTTCATCACCGGGGCGAGCGCGTCCTCACTCTTGACCTTGAGCGCGACGCCGTACGAGACGGCCGAG 

< LLVPQAVTNMVPALADES KVKLAVGYSVAS 
54 6 6 9 GTCATCAGCGCGATCAGGGCGAGCATCAGGTACGCCAGCAGCAGGTCGCCGATGAACACGCGCAGCTCGAACAGGAGCGCGAGCAGGGTGAT 

<TMLAILALMLYALLLDGIFVRLEFLLALLTI 
54 761 GATGACGGCCTGGGCGAGCAGCGACACGACGTCGCGCAGGGCCCGGCCGAGCAGCAGCGCGAGCCGGCTGACCGGGGTGACCCGGGACCGTT 

<IVAQALLSVVDRLARGLLLALRSVPTVRSRE 
54 853 CGATGACGCCGGCGCGCAGCTCGGCGATCAGGCCGAAGCCCTGGAAGAGGCCGCCGAAGATGGCCAGCAGCACCAGCAGGCCGGGCACGAAG 

< IVGARLEA ILGFGQFLGG F IALLVLLGPVF 

54 94 5 ATCTTGTACGCCTCGGCCTGGGTCGGCGCGTTCAGCGCGGGCTTGAGCAGCGGGGCGAAGAGGAGCAGGTACATCACCGGCTGGAAGACGCC 

<IKYAEAQTPANLAPKLLPAFLLLYMVPQFVG 
5503 7 GACGAAGACCCAGACCGGATTGCGGAGCAGG AGTTGCATCTGGCGCTGGGCGACGAGCCAGGTGTCGCGGGCGAACTTCATGATCGGACT 

<VFVWVPNRLLLQMQRQAVLWTDRAFKM 
5 512 7 CCGGGTGGTCAGGACTCGCGCAGCG AGCGGCCGGTCTTGGTGAGGAAGACGTCGTCGAGGCTGGGGCGGTGCAGCTCGATCGAGCTGAGCC 
<• SERLS RGTKTLFVDDLSPRHLEI SS LR 
55218 TGAGGCCGGACTGGTCGAGCCGGCGCAGGACCTGCGGGATGGCGGTGGCCCCCTCGTCGACGGTCAGGCGCAGGCCGCCGCCGTCGACGGTT 

< LGSQDLRR lvqpiata gedvtlrlgggdvt 

55310 TCCAGCTTGGTGACGTACGGCTCGGTGTCGAGCAGTTGGGCGGCCTGCGGGGTGGCGGCGGCGTCCAGCCCGACGAGCAGCACCTCGCCGGA 
<ELKTVYPETDLLQAAQPTAAAD*'LGVLLVEGS 

554 0 2 GATCTCCCGCTTCAGCCCGCCCGGCGTACCCTCGGCGACCACCTCGCCGTGGTCCATGATCGCGATCCGGTCGCAGAGCGCGTCGGCCTCGT 
<IERKLGGPTGEAVVEGHDMIA IRDCLADAED 

554 94 CCAGGTAGTGCGTGGTGATGAAGACGGTCATCCCCTCGGCGCGCAGCCGACGGATCTCGTCCCACATGTGGGCGCGACTCTGCGGGTCGAGG 

< LYHTTI FVTMGEARLRRI EDWMHARSQPDL 

55 58 6 CCGCTGGTCGGCTCGTCCAGGAAGACAATGCGGGGGTCGTGGATGATGCCGAGAGCGATCTCGACGCGGCGGCGCTGGCCGCCGGAGTAGGT 

<G STPEDLFVIRPDHIIGLAI EVRRRQGGSYT 
5 5676 CTTGCACTTACGGTCGGCGTACTCGGTGAGCTGGAAGGCGGCCAGTGCGCGCTCGGCGCGGCGGAGGGCGTCGGCCTTGCCGATGCCGTACA 

<KCKRDAYETLQFAALAREARRLADAKGIGYM 
55 7 70 TCCGGGCGTGCAGGACCAGTTCCTCGCGGGCGGTGGAGTCGTCCCAGGTGCTGCCGCCCTGGGCGACATAGCCGATCCGGCGACGCACCTCG 

<RAHLVLEERATSDDWTSGGQAVYGIRRRVE 
5586 2 GCCGGGTTCCGCAGCAGGTCGGCCCCGGCGATGGTGGCCTGGCCGCCGTCGGGGGTGATGAGGGTGGCCAGCATCCGCAGGGTGGTGGTCTT 

<A PNRLLDAGAITAQGGDPT I LTALMRLTTTK 
55 954 CCCGGCGCCGTTGGGGCCGAGGAACCCGAAGATCTCCCCCTCGGCGACGTCCAGGTCGACGCCGCGCACGGCGTCGACCGTCTTGTGCTGTC 

<GAGNPGLFGFIEGEAVDLDVGRVADVTKHQR 
560 4 6 GACCGGCG CGGG AGCGAAACGACTT CCG CAGCCCTCTGGT CTGG AT CATCTTCGCTCCTGGTCGTCCTTAGC CGG AC CGGG CCGGCCCTC 

< GARSRFSKRLGRTQIM 

5613 6 TCTCCGGGACGCCACGCCACACGGGTGGCCCCGAACGTCGCGCCGAGGCTAACGCG ATATAACTTCTCTAGTCAACTTTGATTAATGGCGA 

<• RS I VERTLKS »HR 
5622 7 CCGTCGGCCCCCTCCCCCACGTTCCAGCCGTCCTGACTGGCCAACCCTTCGGGCAGATACGGCACGCCGGCCTCGATCCGGTCGGCGACCCG 

<G DAGEGVNWGDQSALGEPLY PVGAEIRDAVR 
56319 CTCACACCAGGCCACCTCGACCTCTCCCCGGGCAATCCACAGCTCGTACATCCAGCTCACGCCGACCGGCTTGGAGTCGCGGATCCAGGAGG 

<ECWAVEVEGRAIWLEYMWSVGVPKSDRIWSS 
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56411 ACTCCATCGAGGCACGCATGGTTTCGACACTGGCCCGCAGCACCTGCCCCCGACTGCGCAGCGCGGCCACCGCCTCCGGCCTGGGCAGCGCC 

<EMSARMTEVSARLVQGRSRLAAVAEPRPLA 
5 6 5 03 GGCAGGAACGCGAACGCCGCCACGAACGGATCGCTCGTCTGATGATTGCCCCACCACAGGCCGCGCAGCAGCGTCTCGAACTCGTCGACCCC 

<P LFAFAAVF PDSTQHNGWWLGRLLTEFEDVG 
565 5 5 CTTCGGGGTGATCTCGTACGTCGTCCGCGCCCGCCGGGCGCCGACCTGCTCGGTGGCGACCTCGCGGAGCAGCCCCTCCTCGCCGAGCTTGC 

<KPT I EYTT RARRAGVQETAVE RL LG E EG LKR 
5 6 6 S 7 GCAGCGCGTGGTAGATCGAGCCGGGCTGCACGTTGGCCCACTTGTCGGCACCCCAACTGAGCAGCTCGCGGCGGACGTCGTAGCCGTGCACC 

<LAHYISGPQVNAWKDAGWSLLERRVDYGHV 
5 67 7 9 GGCTGCATCCACTTGACCAGGCCGAGAATCATCATGCGAGTGGCAGACACCGGAAAAGCGTATTAGACAAGTTTGACTATCCAAGCATCTG 

<P QMWKVLGL IMM 

56 8 7 0 GGCAGTGCCTCATCCCACACTGAGCGATCGTTAGGGCCACGACGCCGGCCGATAAACTCCCCGTCAGTAACATCCCGGGAGGAGCCACGAG 
5 6 9 6 1 GTGCGCAAGGTACTCATCGCCAACCGAGGCGAGATCGCCGTCCGCGTCATCCGCGCCTGCCGCGACGCCGGCCTGGGCAGCGTCGCCGTCT 

>VRKVLIAN RGE IAVRVIRA CRDAGLG SVAV 
5 70 5 2 ACGCGGACTCCGACCGGGACGCCCTGCACGCGACCCTGGCCGACGAGGCGTACGCCCTGGGCGGCGACACCGCCGCCGAGACGTACCTGCGG 

>Y ADSDRDALiHATLADEAYALGGDTAAETYLR 
5 714 4 ATCGACAAGCTGATCGCCGTCGCGGCACAGGCCGGGGCCGACGCCGTCCACCCCGGGTACGGCTTCCTCGCCGAGAACGCCGACTTCGCCCA 

>I D KL I A V A A QAGADAVHPG YG FLAENAD FAQ 
5 7236 GGCCGTCCTCGACGCCGGGCTTACCTGGATCGGCCCGACCCCACAGGCGATCCGCGACCTGGGCGACAAGGTCACCGCCCGGCACATCGCCC 

>AVLDAGLTWIGPTPQAIRDLGDKVTARH IA 

57 3 2 3 AGCGGGCCGGCGCGCCCCTGGTTCCCGGTACCTCGGACCCGGTCGGCAGCCCGGACGAGGTGATCGCATTCGCGGTCGACCACGGCCTGCCG 

>Q RAGAPLVPGTSDPVGSPDEVIAFAVDHGLP 
57420 GTCGCCATCAAGGCCGCCTTCGGCGGCGGCGGGCGCGGCCTCAAGGTGGCCCGCACGATGGAGGAGATCCCGCACCTGTTCGAGTCGGCCAC 

>VA I KAAF G G GGRG LKV.AR T M E E I P~H L F E S AT 
5 7512 CCGGGAGGCGGTCGCGGCGTTCGGCCGGGGCGAGTGTTTCGTCGAGCGGTACCTCGACCAGCCCCGGCACGTCGAGGCCCAGGTCCTCGCCG 

>REAVAAFGRGECFVERYLDQPRHVEAQVLA 
57604 ACCAGCACGGCAACGTGATCGTCGTCGGCACCCGGGACTGCTCGCTGCAACGCCGGCACCAGAAACTCGTCGAGGAGGCGCCCGCGCCGTTC 

>D QHGNVIVVGTRDCSLQRRHQKLVEEAPAPF 
5 7 6 S 5 CTCACCGACGCCCAGCGCCGGCAGATCCACGACAGCGCCAAGGCAATCTGCCGGGAGGCCGGCTACCACGGCGCCGGCACCGTGGAGTACCT 

>LTDAQRRQ I HDSAKAICREAGYHGAGTVEYL 

577 6 8 GGTGGGCACGGACGGCACGATCTCCTTCCTTGAGGTCAACACCCGCCTGCAGGTCGAGCACCCGGTCACCGAGGAAACCGCCGGCATCGACC 

>VGTDGTIS FLEVNTRLQVEH PVTEETAG ID 

578 8 0 TCGTCCGCGAGCAGTTCCGGATCGCCGACGGCGAGAAGCTGCGGCTGGCCGAGGATCCGACCCCGCGCGGGCACTCCATCGAGTTCCGGATC 

>L VREQFRIADGEKLRLAEDPTPRGHS I EFRI 
5 79 72 AACGGCGAGGATCCGGGCCGCAACTTCCTGCCCGCCCCCGGCACCGTCACCGCGCTGCGGCTGCCCACCGGCCCCGGTGTCCGGGTGGACAC 

>NGEDPGRN FLPAPGTVTALRLPTG PGVRVDT 
5 8 0 64 CGGCATCTCCGCCGGCGACGTGATCGGCGGCAACTTCGACTCCCTGCTGGCCAAGGTGATCATCACGGGCGAGACCCGCACCGAGGCCCTGG 

>GI SAGDVI GGNFDSLLAKV I ITGETRTEAL 
5 815 6 AGCGGGCCCGGCGGGCGCTGGACGAGATGGTCGTCGAGGGAATGGCCACGGCGCTGCCGTTCCACCGCCTGGTGGTACGCGACCCCGCGTTC 

>E RARRALDE MVVEGMATAL P F HRLVVRD P A ~ F 
5824 8 ACCGCCGCGCCGTTCACCGTGCACACCCGGTGGATCGAGACGGAGTTCGACAACACCGTCCTGCCGTTCACCGCCGCCGCCGGCCCCGCCGA 

>TAAPFTVHTRWIETEFDNTVLPFTAAAGPAE 
5834 0 GGGCCCGGCCGAGCGGGAGACCGTCGTGGTCGAGGTGGGCGGCAAGCGGCTGGAGGTGACCCTCCCCGCCGGCCTCGGCGCGGGTACGGCCG 

>GPAERETVVVEVGGKRLEVTLPAGLGAGTA 
58432 CCGGGCCCGCCGCGCGGAAGCCGGCCCGGCGGGGCGGCGGGGCCAAGGCCGGCGCGGCGGTCGGCGGCGACGCCCTCACCTCTCCGATGCAG 

>A GPAARKPARRGGGAKAGAAVGGDALTS PMQ 
58524 GGCACGATCGTGAAGATCGCCGTCGCGGACGGGGACACCGTCGCCAAGGGCGACCTGGTCGTCGTGCTGGAGGCGATGAAGATGGAGCAGCC 

>GT IVKIAVADGDTVAKGDLVVVLEAMKMEQ P 
58616 GCTGCACGCGCACAAGGCGGGCACGGTCGGCGGGCTGTCCGCCGAGGTCGGCGCGGTCCTCGCCGCCGGCGCCCCCATCTGCACCATCACCT 

>LHAHKAGTVGGLSAEVGAVLAAGAPI CT IT 

5870 8 GAGGTGCAAGGAGGGGCCCCCTGTTAACGCATTCGGTATAGGAAGGGCCCCTTCCTAACCACGCGCCCGGCGGGGCCGCGCCCCAGCCCGGG 
>© 

5 8 80 0 TACGCGTACCGGCCGGGGGTGTTTTCCGCGACCACCGCGAGCGGTGAGGACCGGGGCCGGGAATGATGGCCAGGTGCGGTTCCTACATGGC 

>>VRFLHG 

5 8 8 91 GCGGTTCCCGCGCACGACCTGACCTACAACGACGTCTTCATGGCGCCGAACCGCTCCGAGGTCGGCTCCCGGTTGGACGTCGACCTGGCCAC 

>AV PAHD L'T Y NDV FMAPNRS EVG S R LDVD LAT 
5 8 983 CTCCGACGGCACGGGCACCACCATCCCGCTGGTGGTGGCGAACATGACGGCGGTGGCCGGCCGGCGGATGGCCGAGACTGTCGCCCGGCGGG 

>SDGTGTTIPLVVANMTAVAGRRMAETVARR 
5 90 7 5 GCGCACTCGCGGTGATCCCGCAGGACATCCCGATCGAGGTGGTGGCCAACGTGGTCGCCTGGGTCAAGCAACGGCACCTGGTGCACGACACG 

>G ALAVIPQD I PI EVVANVVAWVKQRHLVHDT 
5916 7 GCGATCACGCTCGGCCCCACCGACACCGTCGGCGATGCCATCCATCTGCTGCCGAAACGGTCGCATGGCGCGGTGGTGGTGGTCGACGAGGC 

>AITLGPTDTVGDAIHLLPKRSHGAVVVVDEA 
5 92 5 9 CGGTCGGCCGCTGGGCGTGGTG ACGGAGGCGGACACCGTCGGGGTGGACCGCTTCGCCCAGCTCCGCCACGTGATGTCGACCGAGTTGCACA 

>GRPLGVVTEADTVGVDRFAQLRHVMSTELH 
5 93 51 CGGTGCCGGCGGACGCGGACCCGCGTACCGGATTCGACCGGCTCTCGGCGGGCCGGCGGCGGCTCGCGCCGGTGGTGGACGGCGACGGCCGG 

>TVPADADPRTGFDRLSAGRRRLAPVVDGDGR 
594 4 3 CTCGTCGGGGTGTTGACCCGCAAGGGCGCGCTGCGCGCGACCCTCTACACCCCGGCGGTGGACGACCGGGGCCGGCTGCGGATCGCGGCGGC 

>LVGVLTRKGALRATLYTPAVDDRGRLRIAAA 
5 953 5 CGTCGGCATCAACGGCGACGTCACCGGCAAGGCCGCCGCGCTGCTGGAGGCCGGGGTCGACGCCCTGGTGGTGGACACCGCGCACGGCCACC 

>VG INGDVTGKAAALLEAGVDALVVDTAHGH 
5 9627 AGGCGCGGATGGTCGCCGCGCTGCGGGCGGTGCGCAAGCTTCACCCGGGCGTTCCGGTCGCGGCCGGCAACGTGGTCACCGCCGATGGGGTA 

>Q ARMVAALRAVRKLHPGVPVAAGNVVTADGV 
59719 CGCGACCTCGTCGAGGCCGGCGCCGACATCGTGAAGGTGGGCGTCGGTCCGGGCGCGATGTGCACCACCCGGATGATGACCGGGGTGGGGCG 

>RDLVEAGADIVKVGVGPGAMCTTRMMTGVGR 
59811 TCCGCAGTTCTCCGCGGTGCTGGACTGCGCGGCGGCGGCGCGGGACCTCGGCCGGCACGTCTGGGCCGACGGCGGGGTACGGCACCCGCGCG 

> PQFSAVL DCAAAARDLGRHVWADGGVRH PR 
5 990 3 ACGTGGCGCTGGCCCTCGCCGCCGGCGCGTCGAACGTGATGATCGGTTCCTGGTTCGCCGGCACGTACGAGTCCCCGGGTGACCTGTACACG 

>D VALALAAGASNVMIGSWFAGTYES PGDLiYT 
5 999 5 GACGCGGACGGCCGGAGGTACAAGGAGAGCTTCGGGATGGCCTCGTCGCGGGCGGTCAGCGCGCGTACGGCCGAGGACAGCGCGTTCGACCG 

>DADGRRYKESFGMASSRAVSARTAEDSAFDR 
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6 0 0 c 7 GGCCCGCAAGGGGATCTTCGAGGAGGGCATCTCCTCGGCCCGGATGTACCTCGACCCGGATCGCCCGGGCGTCGAGGACCTGATCGACGAGA 

>ARKGIFEEGISSARMYLDPDRPGVEDLIDE 
6 017 9 TCATCTCCGGGGTACGCAGCGCGTGCACGTACGCGGGCGCGCGCAGCCTGGCGGAGTTCGCGGAGCGGGCGCTGGTCGGGGTGCAGAGCACG 

>I I SGVRSACTYAGARSLAE FAERALVGVQST 
6 0 2 71 GCCGGCTACACCGAGGGGATGCCCCTACCGACGAGTTGGTGACCCCGCGCCCGGCGGTGAGAAGGGTTCCCCTCTCTACCGGAGGCGTCAA 

>AGYTEGMP L PTSWo 
603 6 2 CAAGGGGCCCTTCCTTCGTGCGCGGCTGGGTATCGGCGTGACCGACTGCCGCACGCGCCGCCGCACACTGAGCCGCCGCCCGTCGAGGGCCC 
6 04 5 4 ACCGAACGGGCGCCGGGGTCAGTCGAAGAGGCGACGGATGACGGTCCGGGCGGCGGCCTCCGGGTCCGGGCCGGTGCCGGGCGGGAGCGCC 

<«> DFLRRIVTRAAAEPDPGTGPPLA 
6 0 54 5 CCGGCCAGCCAGAGTGTCACGAAGCCGTGCACGATCGACCAGGCGGCCAGGGCGTCCGCCTCCTGGTCCGGGTCGGTTTCCCGGCGCGGGAG 

<G ALWLTVFGHVI SWAALADAEQDPDTERRPL 
6 06 3 7 GGCGGCCACCCCGGCCCGCAGCGCCGCGCCGGCCCGATCACGGGCGGCTGTCACCTCGGGGTCGTCGCGACGGTAGAGCTCCGGGCGGAACA 

<AAVGARLAAGARDRAATV E PDDRRYLE PRFM 
6 072 9 TCACCTCGAAGTGGGCCCGGTGGTCGACCGCGAACCGGACGTACGCCACGCCGGCGTCGAGCAGGTCGCCGGCCTCGCACAGCGCCCCGGCC 

<VEFHARHDVAFRVYAVGADLLDGAECLAGA 
6 08 21 AGCAGGTCGAATCCCTCGACGGCGAGCGCGGTGAGCAGCCCCGCCTTGTCGCCGAAGTGGTGCGCGGGGGCGGCGTGCGAGACCCCGGCCCG 

<LLDFGEVALATLLGAKDGF HHAPAAHSVGAR 
60913 GCGGGCCAGGTCGCGCAGGCTCAGGGCGGCCGGCCCGGCGTCGGTGATCGCGTCGACGGCGGCGGCGAGCAGGGCGCGACGCAGGTCACCGT 

<RALDRLSLAAPGADTIADVAAALLARRLDGH 
610 0 5 GATGGTAGCCACGCGGTCCGGTCATGCCGGCAGCCTAACTTGTCATTGACAAGATAGCCAGGCCGAAGCAATCTAGGCAATGACAAGTTG 

<HYGRPGTM 
610 9 5 CCTTCGACCGAGGAGAACCCCGATGGCGCCCCTGATCGCTCTCATCGCCGGCTCGGCCCTGGCCCGACTCGCCGGGCTACTCAACGTCGACG 
6118 7 CCCTGGCCGGCTGGCACCCCGCCCTGCGGGTCGGGCTCGCGGCCATGTTCGCGCTCACCGGGATCGCCCACTTCACCTCCCGACGGCCCGAC 

612 7 9 CTGGTCGCCATGGTGCCGCCCCGACTGCCCCACCCGGGGCTGCTGGTGACCGTCACCGGCCTGCTGGAGTTGGCCGGCGCGGTCGCGCTGCT 

>MVPP R L PHPGLLVTVTG LLELAGAVALL 

613 71 CGTCCCCGGCACGGCGCGGTGGGCAGCGGCCGGGCTGGGGCTGCTGCTGCTCGCGATGTTCCCGGCCAACGCCTCGGCCGCCCGGCGCGGGC 

>VPGTARWAAAGLGLLLLAMFPANASAARRG 

614 6 3 TGACCCTGGCCGGCCGGCCGGTG ACCCCGCTCGTCCCCCGCGCGCTGCTCCAGGTGATCTTCCTCACCGCCGCCGCCGCGATTTCGTTTGGG 

>L TLAGRPVT PLVPRALLQVI F LTAAAAI S FG 

615 5 5 CCCTGACTATCAGGGAGCTAACATGACCCGCATGGAGCTGATAGGGCTGCGAGACGTGCCACTCGGCCGGCTGCTGGTGACCGCCGGCCA 

>P° >VP LGRLLVTAGH 

6164 5 CGTCGTCGGCCAACGGTGGAACCGCTACCTCGCCGAGGAGCACGGCCTCACCCAGGCGGGCATGGTCACCCTGATGACCCTGGCCCGGCACG 

>VVGQRWNRYLAEEHGLTQAGMVTLMTLARH 
6173 7 GCGAGCTGCCGCACCGGGCGGTCGCCGAGGCGTGCTTCATCCGCCCGGCCACCCTAACCGGCATCGTCGACACACTGGAGCGCGACGGCCTC 

>G ELPHRAVAEACFIRPATLTGIVDTLERDGL 
6182 9 GTCGAGCGGCAACGCGACGACGTCGACCGGCGCAGCGTGCGGCTCGTCCTGACCCCCGCCGGTCGGGAACGGGTCGCCGCGCTCACCAACGT 

>VERQRDDVDRRSVRLVLTPAGRERVAALTNV 
61921 CATGCAGTCCGGACGACCGATGACCTCGGTCGACGCCGACCCGGCGAAGGCCGCCGTGATCCGGCAGTTCCTGCTCGAGGTCATCGGCAGTG 

>MQSGRPMTSVDADPAKAAVIRQFLLEVI GS 
62013 GAGAGGAACCTCGGGTGACGGCCCTCGACGCGAGGCCGGAGGCTCCGGCATGCTGATCCGGCTGCTCCGCGCCCACCTGCGCCCGTACCGTC 

>G EEPRVTALDARPEAPAC o 
62105 GACCGCTGGCGGCGGTGATGGCGTTGCAGTTCGTCGGCACGATGGCCTCGCTCTACCTGCCGAGCCTCAACGCCGACATCATCGACCAGGG 

> M A L Q FVGTMAS LYLP S LNAD I IDQG 
62196 TG TGG C C CGGGG CG AC AC CGG CT AC ATC ATG CGT ACGGGCGG CTGGATG CTGCTGGT C AG CCTGGTG C AG AT CG CCTGCTCCACCG C CG CGG 

>VARGDTGY IMRTGGWMLLVS LVQIACSTAA 

622 8 8 TCTTCCTCGGCGCGCGCTCCGCGATGGGCTTCGGCCGGGACGTACGCGCCGAGGTCTTCGCCCACGTCAACCGGTTCTCCGCCCGCGAGGTG 

>V FLGARSAMGFGRDVRAEVFAHVNRFSAREV 

623 8 0 ACCCGCTTCGGCGCACCCTCGCTGATCACCCGCAACACCAACGACGTGCAACAGGTGCAGATGCTCGTCCTGATGAGCTGCACCATGCTGGT 

> T R FGAPS L I TRNTNDVQQVQMLVLMSCTMLV 

624 7 2 CG CCG CGC CG ATC ATG AG CGT CGG CGGGG TGTT C ATGG CACTG CGGG AGG ACGTCGGG CTGTC CTGGCTG ATG CTGGTCAGCGTG C CGG CG C 

>AAPIMSVGGVFMALREDVGLSWLMLVSVPA 

625 64 TGG CG ATCG CCCTG ATG CTG AT CATC CGG CGG ATGGTG C CCGGGTTCCGGCTG ATG C AG ACCCG C ATCG ACG CGGTCAACCGCGTG CTG CGC 

>LAIALMLI I RRMVPGFRLMQTRIDAVNRVLR 
62 6 5 6 GAGCAGATCACCGGCATCCGGGTGGTCCGGGCGTTCGTCCGCGAGCCGTACGAGACGGCGCGCTTCGGCCGCGCGAACGCCGACCTCACCGC 

>EQ ITGI RVVRAFVRE PYE TARF,GRANADLTA 
62748 GACCGCCCTGCGCACCGGTCGGTTGATGGCCCTGATCTTCCCCGTGGTGACGCTGGTGCTCAACGTCTCCAGCGTCGCCGTGCTGTGGTTCG 

> TALRTGRLMALI FPVVTLVLNVSSVAVLWF 
62 84 0 G CG CGG ACCG CG T CG ACG CCGGCC AG ATC CAGGTCGGCGCGCTC AC CGCCTTCCTGCAGT AC CTC ATG C AG AT CCTGATGGCCGTCATGTTG 

>GADRVDAGQ IQVGALTAFLQYLMQI LMAVML 

62 9 3 2 GCCACCTTCATCCTGATGATGGTCCCGCGCGCCGCGGTCTGCGCCGAGCGGATCGTCGAGGTGCTCGACACCGACTCGACGGTGATCCCGCC 

>AT F I LMMV PRAAVCAERI VEVLDTDSTVI PP 
63024 GGCCGCGCCGACGGCCGAGGTGACCGGCCGGGGCGAACTGGAACTGCGCGGCGTCCGGTTCCAGTACCCGGGGGCGAGCGCGCCGGTGCTGC 

>AAPTAEVTGRGELELRGVRFQYPGASAPVL 
63116 ACGACATCTCGTTCCGGGCCACGCCCGGGCGCACCACGGCCATCATCGGCAGCACGGGGGCCGGCAAGACGACCCTGCTGACGCTGATCCCC 

>H DISFRATPGRTTAI IGSTGAGKTTLLTLI P 
632 0 8 CGGCTGATCGACGCCACCGCCGGGGCGGTGCTGGTCGACGGGGTGGACGTGCGTGACCTGGCCCCGGACGATTTGTGGCGGCGGATCGGGCT 

>RL IDATAGAVLVDGVDVRDLAPDDLWRRI GL 

63 3 0 0 GGTGCCGCAGCGGCCGTACCTGTTCAGCGGCACGATCGCCAGCAACCTGCGGTACGGCAACCCGGACGCCACCGACGCGGAGCTGTGGGCCG 

>VPQRPYLFSGTIASNLRYGNPDATDAELWA 
63 3 92 CCCTGGAGATCGCCCAGGCGCGCGACTTCGTCGCCGAGTTGCCCGAAGGGCTGAACGCCCCGATCACGCAGGGCGGCACCAATATCTCCGGC 

>A LEIAQARDFVAELPEGLNAPITQGGTNISG 
634 84 GGGCAGCGCCAGCGCCTCGCGATCGCCCGGGCCCTGGTCCGCAAGCCGGAGATCTACCTGTTCGACGACTCGTTCTCGGCGCTCGACCTGGG 

>GQR QRLAIARALVRKPEIYLFDDSFSALDLG 
63576 CACCGACGCCCGGCTGCGCGCGGCGCTACGACCGGTCACCGCGGACGCGACGGTGCTGATCGTGGCCCAGCGGGTCTCCACGATCGTCGACG 

> TDARLRAALRPVTADATVLI VAQRVSTIVD 
63 66 8 CCGACCAGATCATCGTGCTTGAGGACGGGGGCATCGTCGGGATGGGCCGACACGCCGAACTACTGGAAGACTGCCCGACGTACGCGGAGATC 

>A DQI IVLEDGGIVGMGRHAELLEDCPTYAEI 
6376 0 GTCGCCTCCCAGCAGACGGCGGGGGTGCCGGCATGACGGCCGTACCGGATCAGCGGCCCACCGCGCGGCCGGCGGAGGGGCCGACGCCGAA 
>VASQQTAGV PA • 

Figure llQ 



36/55 



63 8 51 GCGGCTGCCCTCCGGCAACCAGGGCAGCGGCCCGAGGTGGATGAGCGCCGGCATGCCGGCCGAGAAGTCGATGAACTTCGGGCCGTCCAC 

> M S A G M PA EKS MNFG P ST 
6 3 941 CCGCCGGCTGCTGCGCCGGCTGCGACCGCACCGCCTCCAGCTGGCCGCCATCGTCCTGCTCTCGCTGGTCAGCGTCGGTTGCAACGTGTACG 
>RRLLRRLRPHRLQLAAIVLLSLVSVGCNVY 

64 03 3 GG C CG AAGGTG CT CGG C C ACG C C AC CG AC CTGATCTTC AG CGGGGTG AT CGGCCGGCAGTTGCCGGCCGGCACC AC CGCCG AG CAGGCGGTC 

>G PKVLGHATDLI FSGVIGRQLPAGTTAEQAV 
6412 5 GCGGCGGCCCGCGCGGCCGGTAACGACAGCTTCGCCGACATGCTGGCCCGGATGGACGTGGTGCCCGGGGTGGGCATCGACTTCACCGCCCT 

>AAARAAGNDSFADMLARMDVVPGVGIDFTAL 
64217 GGGCCGGGTGCTGCTGTTCGTGCTCGCCCTCTACCTGGCGGCCAGCGTGCTGTTGTGGTGGCAGGGCTGGCTGCTCAACGGGGTGGTGCAGC 

>GRVLLFVLALYLAASVLLWWQGWLLNGVVQ 
64 3 0 9 GCACGGTGCTGCGGCTGCGCGCCGACGTGGAGGACAAGCTGAACCGGCTGCCGCTGCCGTACTTCGACCGGCAGCCCCGGGGCGAGTTGCTC 

>R TVLRLRADVEDKLNRLPL PYFDRQPRGELL 
644 01 AGCCGGGTCACCAACGACATCGACAACATCTCGCAGAGCCTCCAGCAGACGCTGAGCCAGCTGCTCACCTCGCTGCTCACCGTGGTCGGCGT 

> S R V T N D IDNI SQSLQQTLSQLLTS LLTVVGV 
64 4 93 ACTGGCCATGATGTTCTGGATCTCGCCGCTGTTGGCGCTGGTGTCCCTGGTCGCGGTGCCGATGTCGGTGGTGGTCACCAGCCTGGTCGCGA 

>LAMMFWIS PLLALVSLVAVPMSVVVTSLVA 
64 5 8 5 AGCGGTCACAGCAGCGGTTCATCGCCCAGTGGACGCATACCGGAGAGCTGAACGGCCAGATCGAGGAGGCGTTCACCGGACACGAGCTGGTC 

>KRSQQRFIAQWTHTGELNGQIEE .AFTGHELV 
64 6 7 7 AAGGTCTTCGGCCGGCAGCGCGAGGTGGAGGCCGCCTTCACCGCCAAGAACGAGGAGCTGTTCCGGGCCAGCTTCGGCGCCCAGTTCATCTC 

>KVFG RQREVEAAFTAKNEE LFRAS FGAQF IS 
64 76 9 CGGG ATCATC ATG C CGG CG ATG ATGTTCAT CGGG AACCTC AG CT ACGT CG CG AT CGCCGTGGT CGG CGGG CTGCGGGTGG CGTCGG GGTCG A 

>GIIMPAMMFIGNLSYVAIAVVGGLRVASGS 
64 861 TGAGCATCGGCGACGTGCAGGCATTCATCCAGTACTCCCTCCAGTTCACCCAGCCGCTGACCCGGGTCGCCTCGATGGCCAACCTGCTCCAG 

>MSIGDVQAFIQYSLQFTQPLTRVASMANLLQ 
64 95 3 TCCGGGGTGGCCTCCGCCGAGCGGGTGTTCGCGGTGCTCGACGCCGAGGAGCAGAGCCCGGACCCGGCGGTGCCGGCCCGGGTCGCCGACCA 

>S GVASA ERVFAVLDAEEQS PD PAVP.ARVADQ 
6 504 5 GCGCGGTCGCGTCGAATTCGACCACGTCTCATTCCGGTACGAGCCGGACAAGCCGCTGATCACCGACCTGTCGCTGGTCGCCGAGCCGGGGC 

>RGRVEFDHVSFRYEPDKPLITDLSLVAEPG 
6 513 7 ACACGGTTGCCATCGTCGGGCCGACCGGCGCCGGCAAGACCACCCTGGTCAACCTGGTGATGCGCTTCTACGAGCTGGACGCCGGCCGGATC 

>H TVAIVG PTGAGKTTLVNLVMRFYELDAGRI 
6 52 2 9 ACCCTCGACGGGGTCGACATCACCACGCTGAGCCGCGACGACCTGCGCGGCCGGATCGGCATGGTGCTCCAGGACACCTGGCTCTTCGGTGG 

> T L D G V D ITTLSRDDLRGRI GMVLQDTWLFGG 
653 21 CACGATCCGCGACAACATCGCGTACGGCCGGCCGGACGCGAGCGAGGAGGAGATCGTCGCCGCCGCCCGGGCGACGTTCGTGGACCGGTTCG 

>TIRDNIAYGRPDASEEEIVAAARATFVDRF 
6 5413 TGCGTAGCCTCCCCGACGGCTACGACACCGTCATCGACTCCGAGGGCAGCAACGTCAGCGCCGGCGAGAAGCAGCTCATCACCATCGCCCGG 

>V RS L PDGYDTV IDSEG SNVSAG EKQL I T I AR 
65505 GCGTTCCTGGCCGAGCCGTCGCTGCTGATCCTCGACGAGGCGACCAGTTCGGTGGACACCCGCACGGAGGTGCTGCTCCAACGGGCCATGGC 

> A F L A E PSLLI LDEATSSVDTRTEVLLQRAMA 
655 97 GGCGCTGCGCTCGGACCGGACCAGCTTCGTCATCGCCCACCGTTTGTCCACCATCCGCGACGCGGACCTGATCCTGATGATGGAGCACGGTC 

>ALRSDRTS FVIAHRLSTI RDADLI LMMEHG 
6 56 8 9 GCATCGTCGAGCAGGGCACCCACGAGCAGCTCCTGGCCGCCCGGGGCGCGTACCACCGGCTTTACCAGGCCCAGTTCACCCAGCCGGACCCG 

>RIVEQGTHEQLLAARGAYHRLYQAQFTQPDP 
6 5781 GCCGCCGTCGGGGACCCCGAGCCCCAGCCCGCCTCGGTCCGGGGCTGACCGTCGTGGCCCGGTACATCCCGCCCAGCTCCCCGACCCGCAG 

> A A V G D PE PQ PASVRG © 

658 72 GGGCAGCTCCCGGGGCCCCGGGCGCGGAAGACCAGAGCGTGGTGCGGGCAGAGCGCCGGGCGGAGCACGAACTCGTCGTCGGGGCCCAGCCG 
65964 CATCGGCGGGAACATGTCGTCGGCGAAGTGGCCGAGGTCGGGGCCTCGATGACGGTGACGTCAGTGCGGCGCGCCGGGGTTCCCCGACGTCG 
660 56 TGGTCCGTACCGCGCTGCGCATGGGGAGACGGTAGGCCGGCCGGGGGCGGGCGCGCATCCGAATTCGGGGCCGGCCCGACGCAGCCGAGACG 
6614 8 CATCCGGTTACCGCGAAGGGCGACAGGCCGCCAGTCGCTACAGTCGGCAACCCGGCCGATGCACAGCGCGCCTCGGGGCGGGCCGCCGGCAC 
6624 0 GGGACCTGCCGGCGGCCCATGGCGGCCCGGTCGTCGGCTACGGGGGGTCCGACACTCGAGGCCGCGCCGACTACGGTACGCCGGCCGTCGGC 
6633 2 GATCCGCCGACCGCCTCCACCCGTACGCCGAGGTTGCCGGCCCCAGCGGGTCCCCGGCCCGCCGCGCTGCCCGACCGCCGCGTCGCCGGCCC 
6 64 2 4 GACGACCCGGTTCGTGGCGGGTGCCGGCGAGGTGGAGCCGCCCAGTTACTCCAGCTCGTGGAGCATGAGCTGGCGGGCGGCCTCGGTGATC 

<» ELEHLMLQRAAET I 
66515 GAGCCCGACAGGCTCGGGTAGATGGTGATGGTCTGGGCCAACTCGTTGACCGTGAGGTTGTTCTCCACCGCCATGGTGATCGGCAGGATCAG 

<S GSLS PY IT ITQALENVTLNNEVAMTI PLIL 
66607 CTCGCTGGCCTTCGGTGCCACCACCACACCGCCGATCACCTGGCCGCTGGCCGGGCGGCAGAACAGCTTCACGAAGCCGTCGGCGAGGTCGT 

<ESAKPAVVVGGIVQGSAPRCFLKVFGDALDD 
666 99 CCATCTTCGCCCGGGCGTTGCCCGACAGCGGCAGCATCACCTGGCGGGCCGGGGTCTTGCCGGCGTCCACCTCGTCCTGGGAGACGCCGACG 

< MKARANGS L PLMVQRAPT KGADVE DQSVGV 

6 6791 GTGGCCAACTCCGGGTCGGTGAAGACGTTCGCGGCCACCGTACGCAGCCGCAGCGGCCGGACCGCCTCGCCGAGCGCGTGCCACATCGCGAT 

<TALEPDTFVNAAVTRLRLPRVAEGLAHWMAI 
66 8 83 C CGG C CCTGC ATGG CGG CG ACG CTGG CC AG CGG C AAC AC C C CGGTG CAGTCG CCGG CAG CG T AG AT CCCGGGG ACGTTGGTG CGGG ACACCC 

<RGQMAAVSALPLVGTCDGAAYI G PVNTRS VR 
6 6 975 GGTCGACGGTGACGTAGCCGCCCCGGGCCAGCTCGACGCCGTACTCGGCGAGGCCGAGGTTGGCGGTGTTGGGGATCGAGCCGACCGCGATG 

<DVTVYGGRALEVGYEALGLNATNPISGVAI 
6 706 7 AGCGCGTGCGAGCCGTGCACCAGCCGGCCGTCGGCCAGTTCGACCTCCACCCCGTCGGCGATGCGCTGGACCCTCTCGGCGCGGGAGTTGTT 

<L AHSGHVLRGDALEVEVGDAI RQVREARSNN 
6715 9 GAGGATCGTCATGCCCCGGGAGCGGAACACGCGCTCGATCGCCATGGCGGCGTCGGCGTCCTCGTGCGGCATCACCCGGTCCCGGCTGGAGA 

<LI TMG RSR FVREIAMAADAD EHPMVRDRS SV 
6 7251 CGAGGGTGACCGGGACCCCCATGGCCAGGTACGCGCTGGCGAACTCGGCACCGGTGACGCCGGAACCGACGACGATCAGGTGCTCGGGCAGG 

< LTVPVGMALYASAFEAGTVGSGVVILHEPL 

6 7343 TGCGGCAGGTCGTACACCTGCCGCCAGGTCAGGATGCGCTCGCCGTCCGGCACGGCCGTGGGGAGCTGGCGGGGGGTGGCGCCGGTGGCGAC 
<H PLDYVQRWTLIREGDPVATPLQRPTAGTAV 

6 74 3 5 CAGCACGGTCGACGCGTCGATCGAGTGCTTCTCGG AGCCGTCGGCCGGCGTGACGACGACGCGGTGGGTGTGGCCCAGCATGTCCTCGCCGA 
<LVTSAD I S H KESGDAPTVVVRHTHGLMDEGL 

6 752 7 G C CGGG CCGTG CCGG CC ACG AAGGTG ACG CCGG CTTTCAC CAGCTT CG CGTGG ATGTCGG CGG ACTGGG C C AGGG CG AG CCG CTTG ACCCG C 

< RATGAVFTVGAKVLKAH I DASQALALRKVR 
67619 TCGTGCACGGCCCGGGCGTCGACGGTGACCGCCTCCAGCCCGTCGGAGTGCACCCCGAACTCCTCGGTGTCCCGGTACCCGGTGACCACCTC 

<E HVARADVTVAELGDSHVGFEETDRYGTVVE 

Figur elZR 
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6 7711 CGAGCTGGCGATGAACGTTTTCGACGGTACGCAGTCGGACAGCACGCAGGCACCGCCGGCCCCCTCGGCCTCCACCACGGTGACATCAGCGT 
<SSAI FTKS PVCDSLVCAGGAGEAEVVTVDAD 
6 7803 CCAACTGGGCGGCGACCAGGGCCGCCTCGTACCCGGCCGGCCCCCCGCCGATGATCACGATCTGGCTCACAGCTATCGCCCCTCGTCCGT 

< LQAAVLAAEYGAPGGG I IVIQSV 

6 78 93 GCTCACAGTGACTTTCTTCTCCCCGACGCGTCCGACACGCACCGTCGTATTCTCCCCCAGCCGTCCGCCGGGCTATCGTCATCGCCGTGCG 

> V R 

67984 TCACTACGCCGCCTACGGCTCAAACCTGGACCCCGCCCGGATGCGCGCCTACTGCCCGCACTCCCCGATGGTCGGCGTCGGCTGGCTGGAGG 

>HYAAYGSNLD PARMRAYC PHS PMVGVGWLE 
68 0 76 GCTGGCGGCTCACCTTCGCGGGTGAGGGCGCGATCGGCTGGGAGGGCGCGGTCAGCACCATCGTCGAGTCCCCCGGTGATCGGGTGTTCGTG 

>G WRLTFAGEGAIGWEGAVSTIVESPGDRVFV 
6 816 8 GCGCTCTACGACATCCACCCGTACGACGCCGTCCAGCTCGACGAGATCGAGGGGGTGGCCTCCGGGACGTACCGCAAGCTGCACGTCCGCGT 

>ALYDIHPYDAVQLDEIEGVASGTYRKLHVRV 
6 82 6 0 CTCCACCCTCGACGGCGACGTGACCGCGTGGGTCTACGTCTTCGACGGGTACGAGGGCGGCCTGCCGACGGCGTGGTATCTGTCGGAGATCG 

> STLDGDVTAWVYVFDGYEGGLPTAWYLS E I 
6 83 52 CCAACGCCGCCGAGAAGGCGGGCGCGCCCGACGACTACGTCAGCGAGCTGCGGTCCCGCCCCACCGGCACGGCGTCGGCGTAGCGCGTCTC 

>A NAAEKAGAPDDYVSELRS RPTGTASA* 
684 4 3 CCACACTCCCAGTCTGCTCCGCCCGAGACGGGGCCGCACGGCGGCCCCCGCCGGGGGTCGTCTGTCACACATCATGGTCGCGCCCGTCACA 

< o V 

6 8 5 34 CCGCCGTGGCGGCGGGGACGGTGCGCTCGTACATGTCGGTCCAGCGCATCTCGCGCAGCCCCACCGAGCGGTAGAGCGTCGCCGGGGAGGTC 
<ATAAPVTREYMDTWRMERLGVSRYLTAPST 

6 8 626 GGGTTGGTCAGGTCGACGCCGAGGCCGGCGTGCCGCCGTCCCTTCGCCGCGTAGACCGTGAAGGCCCGCCACAGCAGCGCGGCGCCGACCCC 
<P NTLDVGLGAHRRGKAAYVT FARW LLAAG VG 

68718 GTGCCGCCGGTACTTCGGCAGCACCGACAGGGTCCGCACCCAGCCCGAGTCCTGTTCCAGCGCCTGGTCGGACGACTGCAACGCGCCGGCCG 
<HRRYK PLVS LTRVWGSDQE L A Q D S S QLAG AP 

68 810 GCTCCCCGTCGACCTCGGCGACGAACCACTCGTCCCAGGTCCTGTCGTACGCGGGCAGACGCTCCCGCCAGTGGTCGTACCCGGCCGGCTCG 

< EGDVEAVFWEDWTRDYAPLRERWHDYGAPE 

6 8 902 TAGTCCGGGGTGTCCCGGAACGCCGTGTCGTAGATCCGGTGGAACAGGCGCAGGTCGTCCTCGTCGCCGGCGCGCAGCGGCCGGACCGTCAC 
<YDPTDRFATDYIRHFLR LDDEDGARLPRVTV 

68 9 94 CCCGGGTGGGGGCGGCGGCTCGGCGGGCAGCCCGGCCAGGTCACG ACTCATCCGTACGTACCGCTTCACCCGGCTGAAACCGGCCTCGGTCA 

<GPP P P PEAP LGALDRSMRVYRKVRS FGAE TL 
6 90 86 GCTCCGTCACCCAGCGGGTCTCCGGCGGGTAGGCCGAGGCGCGGACGGTCAGCGCGGG CAGGCTCCGCTCCGCGGCCCGCTCGGCGACCCGG 

< ETVWRTE P PYASARVTLAPLS REAAREAVR 
69178 TCCAGCATCAGAGCGAGCAGCGGGGCGCGTACCGCCTCGGCGCGCTCCGGGTCGACGAGGACGTCGACGAACTCCCGGCCCACCCCGGTCGG 

<D LMLALLPARVAEAREPDVLVDVFERGVGTP 
6 9270 GTTGTCCACCACCGACCAGGCGACGAGCCGGCCCTGCGGGTCGCTGACCAGCCAGGAGTCGCGCGCCGGGTCGAAGAAGGGGGCCGTCAGGG 

<NDVVSWAVLRGQ PDSVLWS DRAPDF FPAT LA 
6 9362 CGGCCTTGACGTCTTCGGCGTCGAAGTCCGGGTGGCCGATCGCGAAGGTGTCTGCGGCGTGCACGACGGCGAGGATCCCAGGGACGTCGTCG 

< AKVDEAD -FD PHG IAFTDAAHVVAL IGPVD.D 
694 54 AGGGTGGGGCGGCGCGCCGCCCAGTCAGCGGGAAGAGTCACGGGGCGATCCTGGCAGCCACCCCGGTCCCGCGCCCCTCATTTTCAACCGC 

<L TPRRAAWDAPLTV 
6954 5 CCCGCGCCCTGCCCCCGCCGCCGCACGCGCTCGCCGGCGAGAGGGGACCCTTTCTACCCCAGGCGTTAGTAAGGGGCCCTTCCTTGCACCAC 

<oYPARGQVV 
6 963 7 GGCGCGTGCGGTGGTCAGCAGGTCGAGCAGCGCGGTGCCCTCGGCCGGCGTGAGCGCCCGGAATCCGGTCGACACCAGCCGGTCGGTCACCG 

<ARATTLLDLLATGEAPTLARFGTSVLR-DTVA 
6 97 2 9 CCTCCGACCACAGCCGGCGGCGCACCAGCGGCCCGACCCGGGGGTACGGCGGCGGCCAGCCGCAGGCCAGCGCCCCGGCCTCCCCCTCCGGC 

< ESWLRRRVLPGVRPYPPPWGCALAGAEGE P 

6 9821 CCGGCCAGCACCGCCTCCAGCGGCGTCATCCCGCCGGCCCGCACCGCCAGCAGGTACGCCCCGGCGAAGTGCTCCCGAAGCAGCAGCAGCCC 
<G ALVAELPTMGGARVALLYAGAFHERLLL L G 

69 913 GGCGGCGGCACGGGCCCCCGGGGTGTCGTCCGGCGGCGGCACGGCCCGCCAGGCGGCAAAGAGTGGCATCCCGCTGGCGTCCGCGGCGTCGA 

<AAARAGPTDDPPPVARWAAFLPMGSADAADV 
7000 5 CCACCCGGTGCAGCAGCGTCGCCAGACGGATCACGCCGGGCACGGCGGTCAGGTGCTCGCTGCCCCACCGGCAGCACTCGGCCAGGTTCGCC 

<VRHLLTALRIVGPVATLHESGWRCCEALNA 
700 97 GTGG C C AC CT CC AG CGG AG CGTG C ACG CGGGCGG CGG CGTCC CAC CCGT CGG CG AC CGCGTCGGGGG CG ACG AAGC CG AGGG C CGCG CTC AC 

<T A V E L PAH VRAAADWGDAVAD PAVFGLAA SV 
7018 9 CGTGGCGGCCCCGACGTCGCCGAGCACCCCGGCCCGGCCGGCGATGTGGAAGGCCCAGCCGGAGATGCCCAGCAGACGGGCCCGGTGCAGGG 

<TAAGVDGLVGARGAI HFAWGS I GLLRARH LT 

702 81 TGGCCGGGCAGCGGGCGAACATCTCCCCGAGCTCCAGCACCAACGGCTTGCTCGCGGCGGCGACCTGCTCCGGCGTCATCGGGTTCAGTCT 

<APCRAFMEGLELVLPKSAAA VQE PTM <o D 

703 72 GCCCCGCCGGGGCCTTCCGTGACAGCCCCGTCTTCGGCGTCGACCGCCTCGATCGCCCCCTCGATTTCGCCGGTGCGGCGCTGGGCGGCGGT 

<A GGPGETVAGDEADVAEIAGEIEGTRRQAAT 

704 64 GACGGCCCGTTCCGCCGCCCGCCGGGCCAGCTTCCGCCGGCTGAGTTCCTGCTCGGCCTCTGCGCGCGATCGCTCCAGCTCGGCCAGCGACC 

<VAREAARRALKRRSLEQEAEARS RE LEAL SR 

70 5 5 6 GCTCGATGCCGGTCAGTTCCTCGGCACCGTCGTGCTCGGCCTCCACCGCGCCGGCCAGCTCCGCCTCGGCCCGCTCCTGGTCGGTACGCGCC 

< EIGTLEEAGDHEAEVAGALEAEAREQDTRA 
7064 8 CGCGCCAGTTCCCGTTCCAGCATCCGGCGTTGCCGGGCCCGCTCGGCACGGGCGGCGCGCTCGGCCTCCCGGGCGGCCCGCTTCGCGCGCGG 

<R ALERELMRRQRAREARAAREAERAARKARP 
7074 0 TGGTGGGTGGGTGGGCGGTGGCTGCTCCTCGCCGCCGGTGACCAAACGGAGCTGGGGGCGGGGCACCTCGCCGAAGCCGGCGTAGCTGGCGG 

<PPHTP PPQE EGGTVLRLQPRPVEGFGAYSAA 
70832 CCCGCAGGAGCCGGCCGGAGCGCACCTGCCCGGCCACCTCCGTGTCGGAGAGCGCGGCGTCGAGCGTCGCCTCCACCTCGCCCAGCGGCAGC 

< RLLRGSRVQGAVETDSLAADLTAEVEGLPL 
70924 TTCCCGGCCGGCGGGCCGCCCTCGGCGTCGGCGGCCAGCCGGCGCGCCTCGGCGACCAGCGCGGCGACCGCCGCCCGCCGCTGGGCGGAGAG 

<K GAPPGGEADAALRRAEAVLAAVAARRQASL 
71016 TTCCCGCAGCCGGGGGCCGCGCAGGTCGCGCTGGGCGCGGCGCAGCGCCTCGGCGAGTTGGGTCAGGTCGGCGACCAGCTCGGGGCGGCGCA 

<ERLRPGRLDRQARRLAEALQTLDAVLEPRRL 
71108 GGG CG AG C AGGTTG ACC AG C C ACG CGG C C ACGGTCGGG CGG CG C AGC CGGG CG AT CTCG CGGGC CGTGG C CGGGTCG C CGG AGCG CCGGGCC 

< ALLNVLWAAVTPRRLRAI E RATAPDGSRRA 
71200 TCGGCGACGGCGGCGTCGCGGGTGGCGACGAACTTCTCCGGCGGCTCGGTGTAGAGCCGCCGGAGGAGGCTCTGCGGGGGCGGCACGGCTC 

<E AVAADRTAVFKE PPETYLRRLLSQPPPV <• 
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712 91 AGACGTCGAGCCGGCTGCCCGGCTCGAGGCGCTGGTAGTCGGTCCCCGACAGCGCGGCGTACTGACGGTTGAGGACGGCGAGGCCGTTGTCG 

<VDLRSGPELRQYDTGSLAAYQRNLVALGND 

713 8 3 TTGAGCAGCCCGTCGTGCAGGGCGAAGGCTCGGCGCGGGGCGACCGCGCGGATG AAGTCGAGCACCTCGGAGAACTTCGACCAGGGGGCGTG 

<N LLGDHLAFARRPAVARI FDLVESFKSWPAH 

714 7 5 GATCGGCGCGAAGAGGGTGTCCACGGGGGCGTCCTCGGGTGCCACCAGGGCGTCGCCGGGGTGGTAGACGACGTCGTTCAGCAGATACCCGA 

< 1 PAFLTDVPADEPAVLADGPHYVVDNLLYGL 
7156 7 GGTTGTCCACGACCGGGATGTCGGGATGGATGACGGCGTGCCGACCGCCGTACGCGCGCACCGCCACGCCCGCCGCGGTGAACGCCTGCCCG 

< NDVVP I DPH I VAHRGGYA RVAVGAATFAQG 

716 5 9 GGTGAGATGGGCTCCAGGGCTTCGGCCACGTCGCCGAGGGCACCGGCCAGCGAGGCCGGGCCGTAGATGCGGAACGGCCAGCGGTCGAGCTG 

<P S I PELAEAVDGLAGALSAPGYIRFPWRDLQ 

717 51 CCGGGTGAGAGCCGCGACGTCCACGTGGTCGGGGTGCTCATGGGTGATCAGCACCGCGTCCGCACCGTCCAGCGCGGTCGGGTCGCTGAAGA 

<RTLAAVDVHDPHEHTI LVADAGDLATPDSFV 
7184 3 CGCCCGGGTCGACGACCAGCACACCGCCGTCGTGCTCGACGCGGAGGCAGGAGTGGGCGAATTTGGTGAGCTGCATCGTGACTCCTCGAT 

<GPDVVLVGGDHEVRLCSHAFKTLQM 
71933 TGACCCAATCGTGATGTCCCTCAGCGCAGTCTGCCGGAACCGGCGCGGTGCCGCGTCGCGTCTGAGGTATCGCCCGATGGGGCGTAGACGAT 
72025 CGGAGCGGGAATGGACGGACACGCAAGACGCAGGTTGCTGACCGCGGTGGGGCTGGTGGCGCTGCTGGCGGCCGGTGGTTGCAGCGCCGACA 
72117 GCGATGGGGCGGGAGAGCCTTCGGCGGCGGTGGCGCCCGAGCAGGCGGCACCTGCCGACGGGGGCAAGGCGGAGCAGGGTGCCGATGCCGGG 

722 09 GGAACGCGGTCCGGCGCGGGTGGCGGCTCACCGGACCTGCGGGTCGACCAGCGGTCAATCATCTACACCGGAACCATGCGGGTGCGGGTGG 

> M R V R V 

723 0 0 ACGATGTGGACGCCGCCGCCCGGTCCGCCATCACGGCGGTCACCGGCGTCGGCGGCTTCGTCGGCGGCGACGAGCGCAGCAGCGGCGGAACC 

>D DVDAAARSAI TAVTGVGG FVGGDERSSGGT 

723 92 GCCGACGCCCGGGCGGAGTTGCAACTGCGGGTGCCGGCGGAGCGGTTCACGGCCGTCCTGGAGGAGTTGGCGAGGCTCGGCCGGCAGGAGCA 

>adaraelqlrvpaerf.tavleela~rlgrqeq 

724 84 gcgggcgatccgcacggaggacgtgaccgaggagaccgtggatctcgacgcgcggatcgccacccaacgggcccgggtcgagagcggtcgca 

>rairtedvteetvdldariatqrarvesgr 
72576 agctgctggcgcgggccacctcgatcggcgacctggtgacgctggagagcgaggtggctaggcgggaggccgacctcgcctcgctggaggcg 

>kllaratsigdlvtlesevarreadlaslea 
72668 aagaagcgccggctggccgacctgacctcgctctccaccatcaccctcaccctggtcggtccggaggcggaggcccgggacaccgagcccga 

>kkrrladltslstitltlvgpeaeardtepd 
72760 caccggcttcgtggtcggcctgcgcggcggctggacggcgttcgtcgcctcgctgggcgtgctgctcaccgtgctcggggcgctgctgccgt 

>tgfvvglrggwtafvaslgvlltvlgallp 
72852 tcgcggtggccctcggcgtgccggtggcggtgctgcttgcggtgctgcggcggcgccggcgtcgaccgccggcgccggccgtcaacgcgcc 

>favalgvpvavllavlrrrrrrppapavnap 

< o R A 

7294 3 gccgccagtgcccgcagcgcggtctgcaccatgacccggatgccgaccgcgatggccccctcgtcgacgtcgaacgaggcccggtgcaggt 

>PPVPAARSAPo 

<AALARLATQVMVRIGVAIAGEDVDFSARHLD 
73 03 4 CGACGTTCGGGCCGGACCGGCCGACGCCGAGGCGGGCCAGCGCGCCGGGGACGTACTCCAGGTACCAGGAGAAGTCCTCGCCGCCCATGCTC 

< VN PG S RGVG L RALAG PVY E L YW S FD EG G "M S 
73126 TGCGGGGTCTCCGCGACCCCCTCCGGGCCGAGCGCGGCGTGGGTCGCCGCCGTGAGCACCTGGATCGCCCGGGCGTCGTTGGTCACCGGCGG 

<Q PTEAVG EPGLAAHTAATLVQ IARADNTVP P 
73218 CCGGCCGCGTAGGTACTCCAGGTCGACGGTGGCGCCGGTGGGGGCGATGACGTCCCGCACCACCTGAGCGACGATCTTGGGGGCCTGGTCCC 

<RGRLYELDVTAGTPAIVDRVVQAVIKPAQDW 
73310 AGGTGTCGCGGTCCATCACCCGCAGGGTGCCGGACGCGGACGCCTCGGACGGGATCACGTTGTACCGGGTGCCGGCCGAGGCGTGGCCGAAC 

< TDRDMVRLTGSASAESPI VNYRTGASAHGF 
734 02 ACGAGCAGCAGCCCGCTGTTGGCCGGCACCCGGCGGCTGACCAGGGCTGGCACCTCGGTGACCAGCCGGCCGAGCGCGTCGACCAGGTCGAC 

<v lllgsnapvrrsvlapve tvlrgladvldv 
73494 ggtcaggtgcgggcgagcggtgtgcccgccggggccggtgagccggacggtgacgttgtcggcggcggcggtgatcgggccgacccgcaggc 

<tlhprathggpgtlrvtvndaaatipgvrlg 
73 58 6 cgaccttgccgacgggctggttggggtcgcagtgcagtgcgaag atctgcacgacgtcgtcgagaccgccggcctcgatgacctccagcgag 

<vkgvpqnpdchlafiqvvddlggaeivels 
73678 ccgcagggcaggatctcctcggccggctggaagatcagccggacccggccgtccaattcgccgaggttggcgagttgggccagcagcacgcc 

<gcplie eapqfilrvrgdleglnalqallvg 

73 7 70 GACGCCGAGCAGCACGGTGGTGTGCACGTCGTGGCCGCAGGCGTGGCAGACACCGTCCTTGGTGGACCGGTAGGGCACGTCCTTGACGTCGG 
<VGLLVTTHVDHGCAHCVGDKTSRYPVDKVDT 

73 86 2 TCAGCGGCAGCGCGTCGATGTCGGCGCGGAGCGCGACCACCGGGCCGTCGGGGCGGCCGTCGATGTCGCAGATGACCCCGTTGCCCTTTGGC 

<LPLADIDARLAVVPGDPRGDIDCIVGNGKP 
73954 AGCAGGCGCGGGCGCAACCCGGCGAGCGACAGCTCGCGGGCGATCAGGGCCGCCGTCTCGAACTCCTCGCCGGAGAGCTCCGGGTGGGAGTG 
<LLRPRLGALSLERAI LAAT EFEEGSLEPHSH 

74 04 6 GATGTGCCGGCGGGTGGCGATAAGGCCGGGCATCCGG AGGGCGAGCAGATGGTCGAGCTCGAAGGGCAAAGGCTGCGACCCGGACGGCGACT 

<IH,RRTAI LGPMRLALLHDLEFPLPQSGSPSE 
7413 8 CCGGCCAGGCCGACGACGCCAGGTGGCTGCCGTTCGGCAGCGTCAACGCACTCGTCACGTCGAAATCTCGATCACTAGAAACGGATGGATC 

< PWASSALHSGNPLTLASTV 

74229 ATCAGGGATGACAGCCGCCAGCCTAGACCTTCGACGGTGACTCTGTGCAACATCATTCCCGTAGCGATCGGACCGCGCAGCGTCACGAATAC 
74321 CCTGGTGGAAGGGCTCCATAATCTGCGGGACAGCAGGTAGATCGCGGTTGAACGCCGTCATCTGCCCCCCACCTCCTACAACCCGTAACCGA 
74413 TTCGGCGGTCACGAAATCACCGTCGATCCGGGGCCGTCGCTACCGAATTGTCGCATTAGTCGTCTCGGTTAACTGCCGCTCGGACAAGTAAC 
74505 CG ACCG C ACTCGG C AG TCGG ACG ACTCC CG ACGGTG AC ACG ACGGGCCG AC CTG CT C C CG C AC ATG CTGTC CGTCCCAATCACCCGG ACGGG 
74597 TT ACCCACTG CC CCGGG TGG CACAC AC CGGG CTG CCC CGGGTGG C ACAC AC AC C AGG CG AC CGT AG CCCCG AACGGTTACGC CCCGCG ACC C 
74689 CTCATCCGAGACAAGGGGTCAGAACCGGTCGCTGGGGCGGTACGTCCCCCACACCTGGCGCAGGGTGCCACAGACCTCGCCCACCGTGGCC 

<• FRDSPRYTGWVQRLTGCVEGVTA 
74 780 CGGGCCCGCAGCGCCTCCTTCATCGGGTGCAGCACGTTCGCCGTACCCTCGGCGGCGGCCCGCAGCTCACCCAGCGCGCGCTCGACGGCGTC 

<R ARLAEKMPHLVNATGEAAARLEGLAREVAD 
74872 GCTGTCGCGCTCCACTCGCAGCTTGGCCAGCCGCTCGGCCTGGGCAGCCTCGATCGTCGGGTCCACCCGCAGCGGCTCGTACGGCTCGTCGG 

<SDREVRLKALREAQAAEI T PDVRLPEYPEDA 
74 964 CGTCGACCGTGAACCGGTTGAGGCCGACCACCACCCGCTCGCCCGAGTCGATCTCCTGGGCGATCCGGTACGCGGACTGCTCGATCTCCCGC 

< DVTFRNLGVVVREGSD IEQAI RYASQEI ER 
7505 6 TTCTGGAAGCCCGCCTCGATGGCGTCGACCACCGAGCCGTGGTCGGCCACCCGCTCGATCAGCTCCACCACCGCCGCCTCGATCTCGGCGGT 

<K QFGAE IADVVSGHDAVREMLEVVAAEI EAT 
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7514 8 CATCGCCTCCACCACGTACGACCCGGCGAACGGGTCGACGGTGGCGGTCAGGTCCGTCTCGTACGCGAGCACCTGCTGGGTGCGCAGCGCCA 
<MAEVVYSGAFPDVTATLDTEYALVQQTRLAL 
7524 0 GCCGGGCGGCCTTCTCGGTGGGCAGCGCGATGGCCTCGTCGAAGCTGTTCGTGTGTAGCGACTGGGTGCCGCCGAGCACCGCGCCCAGCCCC 

< RAAKETPLAIAEDFSNTHLSQTGGLVAGLG 
753 3 2 TGGATCGCCACCCGGACCAGGTTCACCTCGGGCTGCTGGGCGGTGAGCTGCACGCCCGCCGTCTGGGTGTGGAAGCGCAGCATCATCGACTT 

<QIAVRVLNVEPQQATLQVGATQTHFRLMMSK 
75424 CGGGTTCTTCGCGCCGAACTCGTCGCGCATCAGCCGGGCCCAGATCCGCCGGGCGGCACGGAACTTCGCGACCTCCTCCAGCAGGGTGGTCC 

<PNKAGFEDRMLRAWIRRAARFKAVEELLTTR 
7 5 516 GGGCGACGAAGAAGAACGACAGCCGGGGCGCGAAGTCGTCCACCGCCAGCCCGGCGGCGAGCGCGGCCCGGACGTACTCGACGCCGTTGGCC 

<AVFFFSLRPAFDDVALGAALAARVYEVGNA 
75608 AGCGTGAACGCGATCTCCTGCGCGGGCGACGCGCCGGCCTCGGCCATGTGGTAGCCGGAGATGGAGATGGTGTTCCACTTCGGCACCTCCGC 

<L TFAI EQA P SAGAEAMHYGS I SITNWKPVEA 
75 700 CCGGCAGTAGGCGAACGTGTCGGCGACCAGCCGCAGCGAGGGCTTCGGCGGGAAGATGTACGTGCCCCGGGCGATGTACTCCTTGAGGATGT 

<RCYAFTDAVLRLS P KPPF I YTGRAIYEKLID 
75 7 9 2 CGTTCTGGATGGTGCCGTTGAGCGCCGCGCCCGGCACCCCCGCCTCCTCGGCGACGAGCTGGTAGAGCAGCAGCAGCACCGACCCCGGGGCG 

<NQITGNLAAGPVGAEEAVLQYLLLLVSGPA 
75 8 84 TTGATCGTCATCGAGGTGGAAACCTTGTCCAACGGGATGCCGTGGAACAGCAGCCGCATGTCCTCGATCGAGTCGATGGCGACGCCGACCTT 

<N I TMS TS VKDLP IGHFLLRMDE I SD IAVGVK 

75 976 GCCG ACCTCGCCGTGGGCGATCGGGTCGTCCGAGTCGTACCCCATCTGGGTGGGCAGATCAAAAGCGACGGAGAGGCCCATGGTGCCGGCGC 

<GVEGHAI PDDSDYGMQTPLDFAVSLGMTGAR 
7606 8 GCAGGAGCTGGTGGTAGCGCGCGTTGCTCTCCGTCGCGGTGCCGAAGCCGGCGTACTGGCGCATCGTCCACGGCCGCGAGGTGTACATGGTG 

< LLQHYRAN SETATGFGAYQRMTWPRSTYMT 
7616 0 GAGTAGACCCCACGGGTGTACGGGAACTCCCCCGGCTCGCCCAGCCGCTCGGGCAGACCCTCCGGCAGGTCCCTCTGGGTGTAGACACCCTT 

<SYVGRTYPFEGPEGLREPLGEPLDRQTYVGK 
76252 GATCGGGAAGCCGGACTCGCTCGACCGCGGTTCACTCATCCCCGGATGGTAGGACGTGCCACCGCGCCGGAGGGTGAGGGATTGCGCACAT 

<IPFGSESSRPESM 
76343 CGC AC CCCTGTCTTTCCC CG C CG ACT C CG AGGGTGAACACCTGGC CACGTTCG CTC CG ATT AGGT AAACGTTC CGCCG CGT CGGGTTTCG C A 
764 3 5 TCGGGCGTCGGAACCAGCAAGATAGAGGAGTTGTGTCCCAGCCCCCTCGATTTCCCCCGGTGGCTCTTCTGTGACTCAGATCCCGACGTGGA 
7652 7 GCGGCGGACCAGTCAGCCCACCCCACGGACGTGCGGCAGCCGGCACCACCATCGGTGACCGGTACTCGCTCCGGTCCGCGGTGGGCAATGGC 
76619 GGCATGGGCACGGTCTGGCGTGCCACAGACACCCTGCTGCGGCGCGACGTGGCGGTGAAGGAGGTCGTCCTCCCGCCCGGCCTCGCCCCGA 
>MGTVWR ATDTLLRRDVAVKEVVLPPGIiAP 

76 710 GCG ACCGCGACGCCATGTACGAACGCACGCTGCGCG AGGCCCGCGCGGCGGCGGCCATCCAGCACCCGGCCGTGGTCCAGGTGTACGACGTG 

>SDRDAMYERTLREARAAAAIQHPAVVQVYDV 
76802 GT CACCG AGGGTGGT CG C C C CTGG ATCGTG ATGG AG CTG CTGG ACG C CCG C AG C CTGG C CG AC ATGGTG AT CG AGG ACGGG CCGG TGGCCCC 

> V T E G G R PW I VMELLDARS LADMVI EDG PVAP 
768 94 CCGCGCGGTCGCCAAGATCGGCATCGCCCTGCTCGGCGCGCTGGAGGTGGCCCACGCGATCGGGGTGCTGCACCGCGACGTGAAGCCGGCCA 

>RAVAKI G I ALLGALEVAHAI GVLHRDVKPA 

76 98 6 ACGTGCTGATCTGCACCGACGGCCGGTGCGTGCTGACCGACTTCGGGGTGGCCAAGCTCCCCACCGACGTGCAGCTCACCACGCCGGGGATG 

>NVLICTDGRCVLTDFGVAKLPTDVQLTTPG~M 

77 07 8 GTGCTCGGCTCGCCGCACTTCATCTCCCCCGAGCGGGCCATGGGCCAGGAGTTCGGCCCGCCGAGCGACCTGTTCTCCCTCGGCGTCACGCT 

>VLGS PHF I S PERAMGQEFG PP SDLFSLGVTL 
77170 CTACACGGCGGTGGAGGG CCGG C CC CCGTTCG ACAGGGG CG AC CCG AT CG AG ACCATGCACGCCGTGGTCG AGG ACCCGCCGGCCACGCCGC 

>YTAVEGRP PFDRGDPIETMHAVVEDPPATP 
77262 AGCGCAGCGGCCCGCTGACCCGGGTGCTGATGGGGCTGCTGGAGAAGGACCCGGCCCGCCGCCTCGACGTGCACACCGCCCGCGCGATGCTC 

>QRSGPLTRVLMGLLEKDPARRLDVHTARAML 
77354 CGCGAGCTGCTCGCCGGCCCGCTGACCAGCACCGCCACCGCCGTCAACTCGGTCACCGACCCGTACGCGGTGGTGCCGGTCAAGCAGCGCCC 

>RELLAGPLTS TATAVNSVTDPYAVVPVKQRP 
77446 GGCCGTCGCCCCACCGCCCTCCGCTGCGGAGCCGAAGCCGAGCGGGCAGATCGGCGGCCGGGCGATGCTCGCCCCCGGCGAGTCGCTGACCG 

>AVAPPPSAAEPKPSGQIGGRAMLAPGESLT 
7 753 8 ACCGG CTGG CGGCCCTGCGC CGGGGCG AG AAG ACG AGG AAG AGG AAG ACG ACGAC CGC CG CCG CG CTGG ACG ACAC C AG CGCCG ACGCG CTT 

>DRLAALRRGEKTRKRKTTTAAALDDTSADAL 
77630 GCCGGCCCGCTGCACACCCCCACCGGGGCGATGCCCGCGCCCCCGCCCGCCGGCCGGACGTACGGCGGTTCGTCGGAGGCCACCCAGCGGGT 

>AG PLHT P TGAMPAPPPAGRTYGGS SEATQRV 
77722 CG ACG CGGGG AC CG CGCCGG AGG C C AC CC AG CGG ATGACGT ACGG C AGC C CG CC CG ACGCC ACC CAG CGGGTGT C CCACGGG AG CGG CCCGT 

>DAGTAPEATQRMTYGSPPDAT QRVSHGSGP 
77814 CGGAGGCCACCCAGCGGGTGCCCTACGGCGGCGGCTCGGCGGACGCCACCCAGCAGGTGCCCTTCGGTCGCCGGCCCGACGCGACGCAGCGG 

>SEATQRVPYGGGSADATQQVPFGRRPDATQR 
77906 GTCCCCTACGGCAGCCAGCCCGGCGCGACGCAGCCGGTCCCCGGCTTCGGCGCGTCGCCGGACGCCACCCAGCGGGTCGGCGGGGCGTACGG 

> V P Y G S Q PGATQ PVPGFGAS PDATQRVGGAYG 
7799 8 CGGCGGCCAGTGGTCGGTGCCCGGCACCGGCCAGCCGTGGGCCACCCCGGCCACCGCGCCCGCCCCGGCCACCGCCGGCGGCGGCGGGGTCG 

>GGQWSVPGTGQPWATPATAPAPATAGGGGV 

78 0 90 GCCGCCTCGTCGCCACGGTCAAGGGCTGGCCGCGCAAGGTGCAGCTCGCCGCGGCCGGCGGCGTCGCCGTGCTGCTGCTGATCGGCGTGTTC 

>GRLVATVKGWPRKVQLAAAGGVAVLLLIGVF 
78182 GCCCTCTTCGGCGGCGACGACCCGGAGCAGCCGACCACCCCGCAGGGGCAGCCGAGCGCCGGGGCGCCCGCCGGCCCCGGGGTGGAGATGCA 

>ALFGGDD PEQPTT PQGQP SAGAPAGPGVEMQ 
7 82 74 GGAGCAGTCGGCCAAGGGCGTCACGGTGCAGGTGCCCAAGGGCTGGGAGCGGCGCAGTGCCGACGGGGGCGTGTGGGTCGACTACATCGATC 

> EQSAKGVTVQVPKGWERR SADGGVWVDYID 
78366 CGGAGGACAACAGCCGCAAGGTGCGCATCCTCGCCGAGCGGTGGAGCGGCACGTCGACGCGCTGGGCCGAGACCGCCGCGAACGGGCTGCGG 

>PEDNSRKVRILAERWSGTSTRWAETAANGLR 
7 84 58 ACCCGGTCGGCCTCCTGCCAGAAGCCGTACAACCAGGTGTCGATGACCGAGCAGGAGCTCGACGGCAAGGCGGCGGCCGAGTTCGAGTACAC 

>TRSASCQKPYNQVSMTEQELDGKAAAEFEYT 
7 8550 CTG CGG CG ACGGCG AGGG C AAGCGG C ACGGCGTGTGG CGCGGGGTGGTG CACG AGGGCAAGGTCT ACTCGTTCT ACCTCTCCTCG ACCG ACG 

>'CGDGEGKRHGVWRGVVHE G KVYS FYLS STD 
7 864 2 CCCGCTTCGCCGAGAGCAAGCCGATCTTCGATCAGATGGTGGCGTCGTTCAAGCTCCGCGGGAGCGACTGAGCCGGGCCGGGGGCCGACGC 

>A RFAESKPI FDQMVASFKLRGSD • 
7 8733 G ACG CCGG CCGGG CGGGC CG ACG CG ACGTGGTG AGCCG C CG C CGTGCT ATC AAG AG C CATGGCGG CGG AC ACC ACTG ACCTCG ACG AC ACG 

>MAADTTDLDDT 
7 8824 CGCGATCTGGACGACCTTCGCGACCGGGCCCGGCGGTGGCTCGACGACGACCCCGACCCGGCCACCCGCGACGAGCTGGAGGCCGTGCTCGA 

>RDLDDLRDRARRWLDDDPDPATRDELEAVLD 
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78 916 CGGGCTGCCGGCGAGCGCGGCCGAGCTGGCCGACCGGTTCGCCGGGCCACTGACCTTCGGCACCGCCGGGCTGCGCGGCCCGCTGCGCGCCG 

>GLPASAAELADRFAGPLT FGTAGLRG P L R A 
7900 8 GCCCCAACGGGATGAACCTCGCCGTGGTCACCCAGGCCGCCGCCGGGCTCGTCGCCTGGCTCGCCGCCCAGGACGCCACCGGGCCGCTGGTC 

>GPNGMNLAVVTQAAAGLVAWLAAQDATGPLV 
79100 ATCGGGTACGACGCCCGGCACGGCTCGCGGGAGTTCGCCGAGCGCACCGCCCAGGTGGCCACCGGCGCGGGCCGCCCGGCGCTGCTGCTGCC 

>IGYDARHGSR EFAERTAQVATGAGRPALLLP 
79192 CCGCCCGCTGCCCACCCCCGTGCTGGCGTACGCGGTGCGGCAGCTCGACGCGGCGGCCGGCGTGATGGTGACCGCCAGCCACAACCCGCCCC 

>RPLPTPVLAYAVRQLDAAAGVMVTASHNPP 
79284 AGGACAACGGCTACAAGGTCTACCTCGGCGCGCAGCTCGGCGGCGAGCTGGGCGCGGGGGCGCAGATCGTGCCGCCGGCCGACACCGGCATC 

>QDNGYKVYLGAQLGGELGAGAQ I VPPADTG I 
793 76 GAGGCCGCCATCCGGGCGGTCGGCCCGCTGGCCGACGTACCGCTGGGCCCGGCCGGGCAGGTCGTCGGCGACGACGTGGTCGTGTCGTACGT 

> E A A I RAVG PLADVPLGPAGQVVGDDVVVSYV 
7 9468 CGACCGGGCCGCCGCCGTGGTCGACCCGGCGGGGCCCCGGAGCCTGAAGGTGGCCTACACGCCGCTGCACGGCGTGGGCGCGGCCGTGCTGA 

>DRAAAVVD PAGPRSLKVAYTPLHGVGAAVL 
7 9560 CCGCCGCCTTCGCCCGCGCCGGCTTCGGCATCCCCGGCGTGGTGCCCGAGCAGGCGGTGCCGGACCCGGACTTCCGGACCGTCAGCTTCCCC 

>T AAFARAGFGI PGVVPEQAVPD PDFRTVS F P 
7 9652 AACCCGGAGGAGCCGGGGGCGGTGGACCTCCTCGTCGCGCTCGCCGAGCGCACCGGGGCGGACCTGGCGATCGCCAACGACCCCGACGCGGA 

>N PEE PGAVDLLVALAERTGADLA IAND PDAD 
7 974 4 CCGCTGCGCGGTGGCCGTCCGCGACGGCCGGGCGGCGGGCCCGGCACCGGTGAGTGGGGGCGCCTGGCGGATGCTGCGCGGGG ACGAGGTGG 

>RCAVAVRDGRAAGPAPVSGGAWRMLRGDEV 
7 983 6 GGG CG CTG CT CG C CG AC CATCT CATG CGCCGTGG CGTC CACGG CCTGT ACGCC AC CAC CAT CGTGT CGTCGT CCCTG CT ACGGG C C ATGTG C 

>GALLADHLMRRGVHGLYATTIVSSSLLRAMC 

7 992 8 GCCGCCCGTGGCCTGCCGTACGACGAGACGCTGACCGGCTTCAAGTGGATCGTCCGGGCCGGCGGCGGACCGCTGGGTGAGGCCGGCTCCGA 

>AARGLPYDETLTGFKWIVRAGGGPLGEAGSD 

8 002 0 C CCG CTGGTCTTCGGCT ACGAGG AGG CG CTGGGCTACTGCGTCGC C CCGG AG C ACGT C CG CG ACAAGG ACGG C ATCACCG C CG CG CTG AC CG 

> PliVFGYEEALGYCVAPEHVRDKDGITAALT 
80112 TCGCCGAGCTGGCCGCCGGGCTGAAGGCGCAGGGGCGCACCCTCACCGACCGGCTCGACGAGCTGGCCGCCGAGTTCGGCGTGCACCACACC 

>V AELAAGLKAQGRTLTDRLDELAAEFGVHHT 
802 04 GACCAACTCTCGGTGCGGGTGGACGACCTGCGCATCATCGCCGACGCGATGGCCCGGGTCCGGGCGGCCACCCCGACGACCCTGCTCGGCCG 
>DQLSVRVDDLRIIADAMARVRAATPTTLLGR 

802 96 CCCGGTGACCGAGGCGCGGGACCTGCTCCCCGAGGCGGACGTGGTGATCCTGCGTACCGACGGGGCACGGGTGGTGATCCGCCCGTCGGGCA 

>PVTEARDLLPEADVVILRTDGARVVIRPSG 

803 88 CCGAGCCGAAGCTCAAGGCGTACCTGGAGGTGGTGGAGCCGGTGGCGGACGGCGACGTGCCGGCGGCCCGGACGCGCGCGGCGGCCACGCTG 

>T E PKLKAYLEVVEPVADGDVPAARTRAAATL 
8 04 8 0 GCGGCACTCCGCACGGAAATCGCCGCCCTGGTGCAGGGATGAGGTGTGCTCCCGCTTCCGACGCTCTCTCAGCGGGTTGGCGTGTCGCCCC 
>AALRTE I AALVQG • 

8 05 71 CAGGTTCGTGGTGATATGCGGCCAGGTAAGCCACCGTCCTGCCACTATCCATGTCGTAGAACATGATGCGTGCCACTTGGATGTAGTAGTAG 
806 63 GTGACCAATGAGGTGAACCCGGGGCCCTGTTTGAGCGTGTACGTAGCGCCGAAGCCACTGGCAGTCACATCGTCACGGCGGGCTTTCCCGTT 
8075 5 AGCGCTCGCGTAGCTCAGCGCAGCAGGAACATCCTCCCCGCCATAGGGATACAGCAGCTGGCTGGTGTATCCCTTCTCGAAGGGCTTTGCAG 
80 84 7 GTTCTGCCGGCAAGGGCGAGAGGTCCTGGGCGTCCGTCTTGATCACGATCTCGTAGAGCCGGACATCGGCCGTCTCGGTGTTGTACGCCAGG 
8093 9 ATGACCGGCACCGGCAAGGTGTTTGGGAGCGTGACCGAAATGGACGTGAACCCGGGTCCGGGGCGCTGCGTCGGCCGCTGCCCGGCTTGGGT 
81031 GGCGGGCCCGTCCGAGGAATAACTACGGAACATGCAACTGCCGTCTGTCTTGTTGTACAGGAACAGCCACTGCCGCTTTCCACCCTCGCCGT 
81123 TCACCTCGAAGGGCGCTACGGACGACCATCCAGCGCCCCAGTTGTCCTTCTGGCCATCAGGCGGCCACACCTGCTCGCAACCGTCTTCGTTG 
81215 TCCACAACCTGCCACACGGCGCTGTTGCCGGTAAGGGCATCGTAGAGCTGGACATACCGCCTGTCGTTCTGCACGACGGGAACGACGGAGGT 
813 07 CCACCCCTTCCCGAAATTGCGCTGCCAGCGGGGAGGCCACTCGGGTGGCATCTCGTCGTGGGCATTGACTGGACCGAAGGCGGCCGTGCCGT 

813 99 C ATTG CTGTTGT AG AACAG CAC CG CTGGG CCACTGTAGTG C C CGTCG AAATGC AG AG AT ATG ACGTG CGTG AAC CCTGGTCCGAGTTGG ACA 

814 91 TCATAGAGAGCATTCCGGAGTGACATGGGTTTACCCCGTCCTAGGGATCACTGCTTCTCAACAAGATCATCAACGGTGTGCGGCAGGCTGCA 

815 8 3 CAATCGGGTGGAAGAAGAGCCAGCACCGGGCCTCACTGGCCATCACAGTATCGTCTGACCTCTTCGACCGGGCCCGAAACGGGCGAACCGAG 
81675 AGCGTTCCGCGCATCTGTCGGTCGGGCGTGTCGAAGGTGCGGCTGTTCGCCACGTGCGCGGCGAGCAGGGTGCCCTCCGCGCCGTCACGGGT 
81767 TCTGCCACGAGGCGGCCAGGACCCAGACGGTGACCAGCCGGCCCAGGGTGCCGGTCACGATGAACCGCTTGCGACCGCTGGCTGCGCCGCTC 
818 5 9 CGGCGGCCCTCCGCGCCGGACCGGGAACGCGGCACGCTCACCCCGCGTGATCCCTTGCCCACGAAGGGCTCGTCTGAATAGCATCAGCACAT 
81951 CAGGCCCCACCATTACCCGGCACCATTGGCATTCGCTTTCTTCGACGTTGCGGCTGCGGTGCCCGGGCGTTGCATTTCCGGCGGAAAGGTGT 
82043 TGCACGGTGCGTGAAACAAATGGCTCGACCGTGCCGCGACGGCAGCTCGGCAGGCTGCTCACCCAGCTCCGCGAGAGCGCCCATATCAGCA 

>VPRRQLGRLLTQLRESA HIS 
82134 TCGACGCGGCGGCCGGCGAGCTGGACTGCTCGCGGCAGAAGCTCTGGCGGATCGAGCGGGGGCTGACCTCGGCCAAGACACCGGACGTCCGG 

>I DAAAGE LDCSRQKLWRI E RGL TSAKT PDVR 
8222 6 G TG CT CTG CG AG CTGT ACCGGG CC ACG C C CG ACCAGGCG AG CGTG CTG CT CGGG CT CGC CGAGGTGAGCCGGG CCG AGGGGTGGTGG CACG C 

>VLCELYRATPDQASVLLGLAEVSRAEGWWHA 
82318 CCACGGCAGCTCCGTGCCGGCCTGGTTCTCGCTCTACGTCGGCCTGGAGAACGTCGCGAGCAGCATTCGGCACTACAACGCGGAGCTGGTGC 

>HGSSVPAW FSLYVGLENVASS I RHYNAELV 
82410 CGGGGCTGTTGCAGACCCCCGGCTACGCCACCGCGCTCTTCGAGCACAACCGGCCCGAGCTGGGCGAGGAGGAGCGAAAGAAGGCGGTGGGC 

>P GLLQTPGYATALiFEHNRP ELGEEERKKAVG 
82 5 02 TTCCGGACTCAGCGGCAGGGGCTGCTGGCCCGGCGGCTGCCCCCGGCCCCCGAGCTGACCGTGATCCTCAGCGAGGCGGTGCTGCGCCGCCC 

>FRTQRQG LLARRLP PAPELTV I LSEAVLRRP 
82 5 94 GGTGCCGGGCCGATCGGTGATGGCCGACCAGCTCCGGCACCTGCTGGCCGTCGGCGAACGGCACAACATCACCGTACGGGTGCTGCCGCTGG 

>VPGRSVMADQLRHLLAVG ERHN ITVRVLPL 
82 6 8 6 CCGCCGGGCCGCCGCTGGCCGCCGAGGCCGGCACGTTCGTGCTGCTCGACTTCCCGCTCTCGGCGCTCGGCAGCCCGACCGAGCCGCCGACC 

>A AGPPLAAEAGTFVLLDFPLSALGS PTEPPT 
82 7 78 GTCT ACGTCG AGGGG CTC ACCGG CG CGCTCT ACCT CG ACC AGC CG ACGG AGATCG C CG CGT ACG AACGGGTCTGG AGGGGTCTGG ATTCGCT 

>VYVEGLTGALYLDQPTEIAAYERVWRGLDSL 
82 870 CGCCCTCGGCGCGCGACAATCAGCGGAGCTGATCGATGCCATCCGGGGAGAGTGCTATGAGTGATCTGACCGGCGCCCGCTGGCGCACCAG 

>ALGARQSAELIDAIRGECYE« 

82 961 CAC C CG CAGCGG C ACC AACGG CGGGG ACTGCGTCG AGGTGG CCG ACAACCTCAC CGG C ATCGTCGGCGTCCGGG ACAG CAAGG ACCCGGG CG 
83053 GGCCGGCCCTGACCGTCCCGCCCGCCGCCTGGTCCGCCTTCGTCACCGAGGTCAAGGCCAACCGCCTCACCCGCTGACAGCTCCGCAACGAA 
8 314 5 CCAACCCCCGGCTACGCCTCAGACGCGGCCAGCGCCCCAGGGCTGCCCACGCCTCAGACAGCCCGCGCCTCAGACGTGCCTGCGCCTCAGAC 

83 23 7 AG C CCGGG CCG C AGGGG CG CGTG CG C CTCAG ACGG C C CGGGC CTC AGGCGCG CTTG C CG AGGGCGG CGTCG AC CGCCTTG CC CAGGGCGGT 

< • A R K G LAADVAKGLAT 
83 3 2 8 GACGACCAGGGCCACCGAGGGGCGGACCACCGAGTCGTCCAGGCTGACGGTGCCGGAGAAGCCGGCCCCGCCCGCGATCTCCTCCAGCCGAC 
<VVLAVS PRVVSDDL SVTGS FGAGGAI EE LRR 
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83420 GCCGGGCGTCGACGGCGGCGTCGCCGGTCAGGCCGAGCGCCGGCTCCATCCGGAGCACCGCGACCAGGGTGGCCAGCGCCGCCGTCGCCTCG 

< RADVAADGTLGLAPEMRLVAVLTALAATAE 
83512 TCGGGGATCTGCCCGCCGGCCAGCGCCTCGGCCAGCCGCCGCCGGGAGTCCTCCTCCACGGTGGCGTCCGCCGTCGGGTAGCGGTGCACGTG 

<D PIQGGALAEALRRRSDEEVTADATPYRHVH 
83 6 0 4 GATGAAGCCCAGCTCGGTCTCCTCGACGTCGCGCACCACGCCCCGGGCGACCAGGTCGCCGAGGATCCGGTCGCGCAGGCCGTGGCGCAGCC 

<I FGLETEEVDRVVGRAVLDGLI RDRLGHRLR 
83 6 96 GCTGCACCCAGGAGGACGGGGTGTGCGGCGTGTCGGCGGCCATCCGGCCCAGGACCTCGTCCAGGATCGGTTCGCCGGTGGGCGCGGGGTCC 

<QVWSSPTH PTDAAMRGLVEDLI PEGTPAPD 
83 78 8 GTGACCACCAGGTTCCCATCGACGTACGCGACCCGGCCGGCGAGGGCCAGCTCGATCAGGACGGCGGCGGCCATCCCGAGGTCGAGGCTGAT 

<T VVLNGDVYAVRGALALE I LVAAAMGLDLS I 
83 8 8 0 CCGCGGCATGGTCGCCTTGCCGGATTCGTCGTCGTACGCGAGGAGCAGCAATTCCTCGGCCAGCGCAACACCAGTCATGGCCGGAGACGG 

< R P M 

83 9 70 TAGCGCCTGAGCGCACCCCGTGCGCCCCCAACTCGCCCACGAGCGCACTCGCCCGGTGAGAGGGGAACCCCGCTATACCGCAGGCGTTAACA 

84 06 2 gggggcccttccttgcgatcagaagcggggcataccgccgaactggcggtcgccggcgtcgccgaggcccggcacgatgaacatccggtcg 

<o frpmggfqrdgadgl.gpvi fmrd 
84153 ttgaggctctcgtcgatcgcggcggtgacgaggcgcagcggcaggccggactgctccagccgggcgatgccgacgggcgcggcgagcacgca 

<n lsed iaatvlrlp lg sq e l ra i gvpaalvc 
8424 5 gagcacggtgatgtcggtgcagccccgctcggccagcagccggcagcagtgctccagggagccgccggtggccagcatcgggtcgaggacca 

<lvtidtcgreallrcchelsggtalmpdlvl 
84 33 7 gcacgggcaggccggcgaggtcgcgcggcagcgactccatgtacgcgcgcggctcgtacgtctcctcgtcgcgggcgaggccgacgaagccc 

< vplgaldrplsemyarpeyteedralgvfg 
84 4 2 9 atggacgactcggggagcagggcgagcgcggcgtcggccatgccgagaccggcccgcagcacgggtaccagcagcggcgggttggccagccg 

<m s se p llalaadamg l-gar lv p v l ~l p pnalr 
84 5 21 ggtgccctcggcgtcggtgacgggggtctgcaccgggtacttctcgacggggaaggagcgggcggcctcgtacacgagcatggtggtgagct 

<tgeadtvptqvpykevpfsraaeyvlmttle 
84613 cgtg cag cg cgg c c cgg aagttgg ag g ag tcgg tg cg cg cgt cc cg c atggcgg t gag c cgcg actggg cg ag cgg atggt caatg acgtgt 

< hlaarfns sdtradrm 

84 70 5 acgtccacgatcgcccaacctaccgaacgcccgggccccgagacacgaggagcgacgccgctcacccggcccgtgcggcaaccctggacggt 
84797 tgg cgttcg aaggt aggg caggtg cc c aaggt cggg c c cg acct cg ccg cg tg at caag at c acg agg cgtg cgggtg cgt ag actt ccggg 
84 8 8 9 catgacggcgacagcgacgtcggcccggtcggacctctccgagctgggacgatccgagaccgctttgcggaacttcctgcacggcctgccg 
>mtatatsarsdls elgrs etalrnflhglp 

84 98 0 ggcgtggaccaggtcggcgcggagcagcgggcggcccagctcggcacccgctcgatcaagaccacggccaaggcccgggcgatcgacctggc 

>gvdqvgaeq raaq lgtrs i kttakaraidla 

85 072 gatccggatggtcgacctgaccaccctggagggggccgacacccccggcaaggtgcgggcgctcgcggccaaagcactgcgccccgacccgg 

> irmvdlttlegadtpgkvralaakalrpdp 
85164 ccgacccgtcctgcccgcacgtcggcgcagtctgcgtctacccggcgatggtcccgtacgtggccgaggtgctgcgcggatccgccgggtcc 

>adpscphvgavcvypamvpyvaevlrgsags 
852 5 6 gggcggccgtccggcggaccggacggcaacgcgccggccggacccggcgtggtgcacctggccagcgtggccaccgcgtttccgtccgggca 

>grpsggpdgnapagpgvvhlasvatafpsgq 
8534 8 ggcacccctggaggtcaagctcgccgacacccgggccgcagtggcggctggcgcggacgagatcgacatggtgatcaaccggggcgcgttcc 

>aplevkladtraavaagade idmvinrgaf 
8544 0 tggc cggccg ct accgcg aggt ct acg acg ag at cgtgg cc accaaac agg cg tg cgggg acg c c c acctc aaggtg atcctgg aaaccgg c 

>l agryrevydeivatkqacgdahlkvi letg 
8553 2 gagctggccacgtacgacaacgtgcgccgggcgtcctggctggccatgctggccggcggcgacttcatcaagacctcgacgggcaaggttcc 

>elatydnvrraswlamlaggdf i ktstgkvp 
85624 cg tcg cgg cgaccct cccggtg acg ctggtgatgctgg agg cggt c cg cg actt c cg cg ccg c cac cggg cgg caggtcggcgtgaag ccgg 

> vaatlpvtlvmleavrdf raatgrqvgvkp 
85 716 ccggcggcatcaagaacaccaaggacgcgatcaagtacctggttatggtcaacgagaccgtcggcccggactggctggacccggactggttc 

>a g g i kntkdai kylvmvnetvgpdwldpdwf 
8580 8 cggttcggcgcgtccagcctgctcaacgacctgctcatgcagcgcaccaagctgacgaccggcgtctactccggtcccgactacttcaccct 

>rfgassll»ndllmqrtklttgvysgpdyftl 
85 90 0 gg actg ag cgtg at ctt cgaat acg cg cc cg c ccc cg ag tc c cg ctcggtggtgg acct c aagcc ctcgtacggg ctgttcgtcg acgg 

> d • >vi feyapapesrsvvdlkpsyglfvdg 
8598 9 ggagttcgtcgacccggccgacggcggcggcttcaagtcggtcaaccccgcctccgaggaggtgctcgccgagatcgccgaggcgggcagcg 

>efvdpadgggfksvnpaseev;laeiaeags 
860 81 ccgacgtggaccgggcggtccgcgccgcccggacggcgtacgagaaggtgtggggcccgatgccgggccgggaccgggccaagtacctgttc 

>advdravraartayekvwgpmpgrdrakylf 
8617 3 cggatcgcccggatcatccaggagcgctcccgcgagctggccgtgctggagtccctggacaacggcaaaccgatccgggagtcccgggacgt 

>riar i iqersrelavles ldngkp i resrdv 
86265 cgacctgccgctggtcgccgcgcacttcttctactacgcgggctgggcagacaagctgccgtacgcgggcttcggcccgaacccccggccgc 

>dlplvaahffyyagwadklpyagfgpnprp 
8635 7 tcggcgtggccgcgcaggtcatcccgtggaacttcccgctgctcatgctcgcctggaagatcgccccggcgctggccgccggcaacacggtg 

>l gvaaqvi pwnfpllmlawkiapalaagntv 

864 4 9 GTGCTCAAGCCGGCGGAGACCACCCCGCTGACCGCGCTGCTGTTCGCCGAGATCTGCCAGCAGGCCGAGCTGCCGGCCGGCGTGGTCAACAT 
>VLKPAETTPLTALLFAEICQQAELPAGVVNI 
86541 CGTCACCGGCGCGGGCGACACCGGCCGGGCGCTGGTCGAGCACCCGGGCGTGG ACAAGGTCGCGTTCACCGGCTCG ACCGAGGTCGGCAAGG 

> VTGAGDTGRAIjVEH PGVD KVAFTGSTEVGK 
86633 CCATCGCCCGGTCCGTCGCGGGCACCGGCAAGAAGGTCACCCTGGAGCTGGGCGGCAAGGCCGCGAACATCGTCTTCGACGACGCCCCGGTC 

>AIARSVAGTGKKVTLELGGKAANIVFDDAPV 
8672 5 GACCAGGCGGTCGAGGGGATCGTCAACGGCATCTTCTTCAACCAGGGGC ACGTCTGCTGCGCCGGGTCGCGGCTGCTGGTCCAGGAGTCGGT 

>DQAVEGIVNGIFFNQGHVCCAGSRLLVQESV 
86817 CGC CG AGCAGGTGCTGG AGTCG CTG AAG CGC CG AATGGCGCTG CTG CGGGTCGG CG ACCCGTTGG ACAAG AACACCG AC ATCGGGG CG ATC A 

>AEQVLESLKRRMALLRVGD PLDKNTDIGAI 
86909 ACTCGGCCGCCCAGCTCGCCCGCATCCGCGAGCTGTCCGCCGCGGGCGAGGCCGAGGGGGCGGAGCGCTGGTCGCCGCCGTGCGAGCTGCCC 

>N SAAQLARI RELSAAGEAEGAERWSPPCELP 
8 7001 GAGCGCGGGTTCTGGTTCGCGCCGACGATCTTCACGGGGGTCACCCAGGCGCACCGGATCGCCCGGGAGGAGATCTTCGGTCCGGTGCTGTC 

>ERGFWF APT I FTGVTQAHR I AREEI FGPVLS 
8 7093 CGTGCTGACCTTCCGCACCCCGGCCGAGGCCGTCGAGAAGGCCAACAACACGCCGTACGGGCTGTCGGCCGGGATCTGGACCGACAAGGGCT 

>VLTFRTPAEAVEKANNTPYGLSAGIWTDKG 
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8 715 = CCCGGATCCTGTGGATGGCCGACCGGCTGCGCGCCGGGGTGGTGTGGGCCAACACGTTCAACAAGTTCGACCCGACCTCGCCGTTCGGCGGG 

>SRILWMADRLRAGVVWANTFNKFDPTSPFGG 
8 7 2 T 7 TACAAGGAGTCGGGCTACGGTCGCGAGGGCGGCCGGCACGGGCTGGAGGGGTACCTCGGTGTCTGAGCGGGTCGCGGTACGCAAGACGTAC 

>YKESGYGREGGRHGLEGYLGV • 
8 73== AAGCTCTTCATCGGCGGGAAGTTCCCGCGCAGCGAGTCGGGACGGTCGTATCTCGTGCAATCCGCGAACGTGTCGCTGGCCTCCCGCAAG 

>VQSANVS LASRK 
87455 GACGCGCGGGACGCCGTGGTCGCCGCCCGCGCCGCCGTGAAGGGCTGGGCCGGGGCGACCGCGTACAACCGGGGTCAGATCCTCTACCGGGT 

>DARDAVVAARAAVKGWAGATAYNRGQ I LYRV 
8 7 5 = D CGCCGAGATGCTGGAGGGCCGCCGCGAGCAGTTCGTCGCGCTCGGCGTGCCGGCCGACGAGGTCGACGCGGCGATCGACCGCTGGGTCTGGT 

>AEMLEGRREQFVALGVPADEVDAAIDRWVW 
876 4 2 ACGCGGGCTGGTCCGACAAGCTCCCCCAGGTGTACGGCGGTGCGAACCCTGTCGCCGGGCCGTACTTCAACCTGTCCGCGCCCGAGCCGACG 

>Y AGWSDKLPQVYGGANPVAGPYFNLSAPE PT 
8 773 4 GGGGTGGTGGCCGTGGTGGCCCCCGAGGCCCCCGCGCTGCTCGGCCTGGTCAGCGTGATCGCCCCGGCGATCGTCACCGGCAACACGGTGGT 

>GVVAVVAPEAPALLGLVSVIAPAIVTGNTVV 
8 7 8 2= GGTGGCGGCCTCGCCGACCCAGCCCCTGGCCTCGGTGACCCTGGCCGAGGTGCTGGCCACCTCCGACCTGCCCGGCGGGGTGGTCAACGTCC 

>VAASPTQPLASVTLAEVLATSDLPGGVVNV 
8 7915 TGACCGGTGCGATCACCGAGACGGTGCCGACGCTCGCGGCGCACCTGGACGTCAAGGCGATCGACCTGACCGGGGTGGGCGACGCGTCGCTC 

>L T G A I TETVPTLAAHLDVNAI DL T GVGDAS L 
8 8010 GCCACCGAGCTGGAGGTCAGGGCGGCGGAGAACCTCAAGCGGGTGATTCGGCCGGCCCCGGCCGACCACGACTGGTACGCCGACCCGGGCCT 

>ATELEVRAAENLKRVIRPAPADHDWYADPGL 
8 810 2 CACCCGGATGACGACGCTGCTGGAGACGAAGACGGTCTGGCACCCCAAGGGCGTCTGAGCCCACCGACCCGTCCACCGACCCGCCCGCCAC 

>TRMTTLL ETKTVWHPKGV • 
8 8193 CCGGCCGCCGGAGGCAGGGGTGGGCGGCGGGTGGGGTGGATCTACTACGAGGGGTAGGATTGCCGCGTGACT CGGTTGGGTGATCTTGAGC 

882 8 4 GGGCGGTGATGGACGTGCTGTGGGACACCGTCCCGGGCACGTCGGACGGGGTGACGGTGCGCGAGGTCGCCGAGGCCCTCGACGGCCGCGA 

>MDVLWDTVPGTSDGVTVREVAEALDGRE 

883 75 G CTGGCGT ACACG ACGGTG ATG AC CG TG CTGG ACCGG CTCG C CGG CAAGGGC ATGGTG CGG CG CC AGCGGGAGGGGCGGG CCTGG CG CT AC C 

> LAYTTVMTVLDRLAGKGMVRRQREGRAWRY 
88467 AGGCCGCGGCCAGCCGCGAGGCGCACATCGCCCAGCTCATGCTCGACGCGCTGGACCTCGGCGGCAGCCGGGACGCCGCGCTGGTGCGCTTC 

>Q AAASREAH IAQLMLDALDLGGSRDAALVRF 
885 5 9 GCCCGGTCGGTGACCGGCACCGAGGCCGAGGTGCTGCGCGCCGCCCTCGGCGCCGAGGCGGGCGGCCCGCTGACCGACCGCGTCGACGCGCC 

>ARS VTGT EA EVLRAALGAEAGG PLTDRVDA P 
88651 GCGCGCCGACCGGGCCGGGCAGCCGGCCCTGGCCGACGAGGCGACGGACCGGTAGGGCCGCCGTCATGGCGTACGCCGTGCACTTCGCCG 

>RADRAGQPALADEATDRo >MAYAVHFA 
88 7 41 CGACGGTCCTGGCCTGCTACCTGACCGCTCAGGTCCTGGCGGCGTCCACCTGGACGTGGCGGGCCCCCCGGATCGCGATCGTCTGCTGGCAG 

>A TVLACYLTAQVLAASTWTWRAPRIAIVCWQ 
888 3 3 GCGGTCGGGCTCGCGCTCGGGCTCTCCGCGATGGGCCTGCCGATGGCGCTCGGCGTGGCCGCGTACGACCGGCCGACCGGCAGCGCGTTGCT 

>AVGLALGLSAMGLPMALGVAAYDRPTGSALL 
88 92 3 CGCCCTGGCCACCGACCTGACCCACGGCACCCTGCCGGCCGGGCTCGGCGCGGTCCACCTCGGTCTGGTCGGGGTCGGGTTCGGCATCGGGG 

> A LA T D L T H G T L P AG LGAVH L G L VGVG F GIG 

8 9017 CGGCGCTGCTCGCCACGACGGTACGCAGCGTGCAGGCGACCGTCCGGGCCCAGCGGCAGCACCGGGACCTGCTCGCCCTGGTGGCCCGGCGG 

>A ALLATTVRSVQATVRAQRQHRDLLALV ARR 
8 910 9 G AC CCGG AGG TG CCGGGGG CGCTGG TG CTGGAC CATCCG AG CGCGG CGG CGT ACTG C CTGCCGGG CG TGCGG CC CCGGGTGGTGGT CAG CGC 

>DPEVPGALVLDHPSAAAYCLPGVRPRVVVSA 
89201 CGGGGCGCTCAGCATGCTCGACCGGGCCGAGCTGGCGGCGGTGCTGACCCACGAGCGGGCGCACGCCCAGGAGCGCCACGACCTTGTGCTGC 

>GALSMLDRAELAAVLTHERAHAQERHDLVL 
892 93 TGCCGTTCACCGCGCTGTGCCGTGCGCTGCCCTGGTTCCGTTGGGTACGCGACGCGCACGAGCGGGTCGCCCTGCTGGTCGAGATGCGCGCC 

>L PFTAItCRAL PWFRWVRDAHERVALLVEMRA 
8 93 8 5 GACGACAAGGCCCGGGAGCTGCACGCCGAGGCTCCCCTCGCGGGGGCGTTGCGCCGGTTCGCCGCGGCCGGCCACCGGATCGCGCCGGCCGG 

>DDKARELHAEAPLAGALRRFAAAGHRIAPAG 
8 94 77 CACCCTCGGCCTGGGCGACCGGGACCTGGACGTCCGGGTCCAGCGGCTGCTGGTCGCCGACCGGCCGCCCCGGCTG ATCGGGGCCGCCGCGC 

>TLGLGDRDLDVRVQRLLVADRP PRLIGAAA 
89569 TGGCGGTGGCGGTCACCCTGGTCGCGCTGCCGGTCTCCCTCTTCCTGAGCTGACGCCCGACCCGGACACGTCCGACCCGGACACGCGCGAC 

>L AVAVTLVALPVSLFLS • 
8 9 6 6 C CGGACACGTCCGACCCGGACGCCTCGCCCGAGTTGGGCCCGTGTCCCACGGGGCCGGCTCGCCTGCCCGTTGCCGGGCCACCGACATGCGGG 
8 97 5 2 GCGATAGGTAGAGAGCCTACGTGTAGTCTTCCTACGACAAGGGAGCCTACTACCGGAGGGCGGCQATGGATCAACTGCTCCTCGCCCGTC 

>MDQLLLAR 

8 984 2 TCCAGTTCGCCACGACCACCTCGCTGCACTTCCTCTTCGTCGTCGTCACGCTCGGTCTGGTCACCCTGCTCGTCGGGCTCCAGACGGCCTGG 

>L QFATTTS LHFLFVVVTLGLVTLLVGLQTAW 
8 993 4 ACGATCACCGGCAATCCCGTCCACGAGCGGCTGACCCGGTTCTGGGGTCAGCTCTACGTGATCAACTACGTGCTCGGCATCGCCACCGGCCT 

>TITGNPVHERLTRFWGQLYVINYVLGIATGL 
90026 GCTCATGGAGTTCCAGTTCGGGCTGAACTGGAGCGGCCTGTCGCGCTACGTCGGCAACGTCTTCGGCGCCCCGCTGGCGATCGAGACCCTGG 

>LMEFQFGLNWSGLSRYVGNVFGAPLAIETL 
9011c TCGCGTTCTTCCTGGAGTCCACGTTCCTCGGGATGTGGATCTTCGGCTGGCACCGGCTGCGCCGGGGCGTGCACCTCGCGCTGCTGTGGGGC 

>V AFFLESTFLGMWI FGWHRL RRGVHLALLWG 
902 1C GTGGCGCTGACCGCGTACGCCTCGGCGTTCTGGGTCATGGTGGCGAACGCCTGGCTGCAGAACCCGGTCGGCTACGAGGTGCGCGACGGGGT 

>VALTAYASAFWVMVANAWLQN PVGY EVRDGV 

903 02 GGCCCACCTGACCGACTTCGGCGCGTTGCTGACCAATCCCACCTTCGGCCTGGCCTTCGGGCACGTGGTCGCCGCCGCCCTGCTCACCGGCG 

>AHLTDFGALLTN PTFGLAFGHVVAAALLTG 
90394 GGATGCTGATGGCGGCGGTGAGCGCCTGGCACCTGATCCGGCGCACCCCGGACCACGCGCTGTTCCGCACGTCGCTGCGGATCGGCCTGGTC 
>G MLMAAVSAWHLIRRTPDHALFRTSLRIGLV 

904 8 6 ACCGCGGCGGTCTCGATCAGCCTGGTGCAGGGCTTCGGCTTCGCCCAGTTCGGGCCGGTCGGGCAGACGCAGCCCACCAAGTTCGGCGGCGG 

>TAAVS I S LVQGFGFAQFG PVGQTQ PTKFGGG 
90578 CG CGC AG CG CG ACG CC CTGGTCG C CG AATGGACCT CCCGGTTCGGG CCCGG CG ACT AC AC CCCGC CCGTG CTGG C CG ACGTCGGG CTCGGTT 

>AQRDALVAEWT-SRFGPGDYTPPVLADVGLG 
906 70 TCATGATCCTGATCGGCCTCCTCCTGGGCTGTCTGTGGCTGCTGCTCCCCCTGCTCTGGCGGGACTGGTTCATCCGGCTGCGCTTCCCGCTC 

>F MILIGLLLGCLWLLLPLLWRDWFIRLRFPL 
90762 TGGCTGATCCTGCTGGCGCTGCCGCTGCCCTTCGTCGCGGTGATCCTCGGCTGGATCGCCCGTGAGGTGGGCCGCCAGCCCTGGGTCGCGTA 

>WLILIiALPLPFVAVILGWIAREVGRQPWVAY 
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9 0 8 5 4 CGGGCTGCTTTCCACCGAGCGGGCGGTCTCGCCGGTCGCGCCCGGGGTGATGCTCGCCTCGCTGATCGGCTTCACCCTGCTGCTCGGCGGGC 

>GLLSTERAVSPVAPGVMLAS LIGFTLLLGG 
90 94 6 TCGCCGTCGCCAACTGGGTGCTGTTCGCCCGGTACGCCGCCCGGGGAGCCGCCGATCCCGCCCTAGGCCGCCGGCCCGGCCCAGCCGCCGAC 

>L AVANWVLFARYAARGAADPALGRRPGPAAD 
910 3 8 GAGTCCCGTCCCGTCCCCGTCCTCGGCTGAGGAGGCCCCTGTGGAACTCGCCTGGTACGCCCTGCTCGGGCTCTTCCTCGCCGGCTACC 

>ESRPVPVLG© >VELAWYALLGLFLAGY 
9112 7 TGGTCCTCGGCGGCTACGACTACGGCGTCGGCCTGCTGCTCGCCCGGGGCGGCCCGCCGGCCCGCCGCCGGGCGGCCCTCACCGCCGTGGGC 

>L VLGGYDYGVGLLLARGG P PARRRAALTAVG 
91219 CCGTTCTTCCTCGGCAACGAGGTCTGGCTGGTGGCGACCGTCGGCATTCTGTTCGGCGCGTTCCCCACCCTGGAGGGGGAACTGCTGTCCGG 

>PFFLGNEVWLVATVGILFGAFPTLEGELLSG 
91311 CTTCTACCCCGTCGTCGCCGCCGCGCTGGCCGGGGTGATCATGGTGACCGTCGGCGTGCAACTGCGCAGCCGCCCGACCGACGAGCCGACCC 

> FYPVVAAALAGVIMVTVGVQLRSRPTDEPT 
914 0 3 GCGCCGCCTGGGACCGGATGGTGGCCGCCGGGAGCCTGCTCGCCGCGTTCGGCTGGGGGGCGCTGCTCGCCGGGCTGCTCCAGGGCGTACCG 

>R AAWDRMVAAGSLLAAFGWGALLAGLLQGVP 

914 9 5 CTGGCCGCCGACGGGCACGTCACGGGCGTCGGCCACGTGGCCACCCCGTTCGCGGCCCTCGCCGGGCTGGCGATGACGGCCCTGGTGGCGGT 

>LAADGHVTGVGHVATPFAALAGLAM TALVAV 

915 8 7 GCACGGTGCGACGTTCCTCACGCTCCGGCTGTCGGCCGCCGACGCCGCACCGCTGGCCCGTACCGCCCGCCGGCTGGTCGCGGTGGCGCTCG 

>HGATFLTLRLSAADAAPLARTARRLVAVAL 
9167 9 CCGCCGTCGCCCTGGCCGCCGTCGCGGGCGCGCTCTCCGATCGGGTACGCGCCGCGACGCAGCGCCCGCTGCCGGCCGTACTGCTGCCGTTG 

>A AVALAAVAGALSDRVRAATQRPLPAVLL PL 
91771 GTACTGGTGGCGGCGCTGCTGGTGGCCCGGGCGGCGCACGCGCGGCACCTGCCCGGGGTGGCCTTCGCCGCCACTTCGGCGGCGCTGGCGCT 

>VLVAALLVARAAHARHLPGVAFAATSAALAL 
918 6 3 GCCGGTGGCGGGAGTCGGCGCGGCGTTGTGGCCCTACGCGCTGGTCTCCACCGTCGCACCGACGGCATCACTGAGCGTGACCGACGCGGCGG 

> PVAGVGAA LWPYALVS TVA PTAS LSVTDAA 
9195 5 CCAGCGGGCCGACGCTGACGGTGCTGGGCTGGCTGGCGCTACCGCTCCTGCCGGCCCTACTAGGCTTCCAGGCGATGTGCTGGTGGGTGTTC 

>A S G P TL T VLGWLAL P LLPALLG F QAMCWWVF 
92 04 7 CGGGGACGAACCGACGGCAGGGCACCGGTGTACTGGTGAGCCGCCGTCCCTTCGACCCACGTCTGCTCCGCCGGGTCCCCGCGGCCCGGCG 

>RG RTDGRA PVYW o 
9213 8 CG AC CTCGC CG TG CT CG CGGTG CT CGG CGGG CTG ACGG CG CTG CTGGT CG TGGGG C AGG CCACCGCG CTGG CC ACGGTG CTGGCCG CCG CG 

>VLAVLGGLTALLVVGQ ATALATVLAAA 
92229 CTCGACGGGCGGTTGGCCCGGCCGGCGCTCGCCGGTTTCCTGGCCGCCGTGGTGGGGCGGGCGCTGGTCGCCTGGGCCCAGGGCACGGTGGC 

>LD GRLAR PALAG FLAAVVG RALVAWAQGTVA 
923 21 GGCGCGGGCCGCCGCGACGGTCAAGGCGGCGCTGCGGGCCGACCTGCTCGCCGCCGTCGGCCGGCACGGTCCCGGCTGGGTCGCCGGGCAGC 

>ARAAATVKAALRADLLAAVGRHGPGWVAGQ 
92413 GGGCCGGGCAGCTCGCCACCCTGGCCGGGCGGGGGCTGGACGCCCTGGACGCCTACTTCACCGGGTACCTTCCGCAGCTCGTGCTCAGCGTC 

>R AGQLATLAGRGLDAL.DAYFTGYLPQLVLSV 
92 505 ACCGTCCCGGTGGCCGTGCTGGCCCGGATCACCTTCGCCGACTGGGGCTCGGCCGTCATCGTCGCGCTGACCCTGCCGCTGATCCCGGTCTT 

>TVPVAVLARITFADWGSAVIVALTLPLIPVF 
925 97 CGGGGCGCTGCTCGGCTGGCAGGCGCAGGCCGCCACCGAGCGGCAGTGGCGGCGGCTGTCGACGCTCGGCGGGCACTTCCTCGACATGGTCG 

>GALLGWQAQAATERQWRRLSTLGGHFLDMV 
9268 9 CCGG C CTG C CC ACG CTG CG CGCGTTCGG C CGGGCCCGGGG C GAGGTCG AGGTGG TCCG C CGG ATGG C CG ACGGG C ACCG CGCCG CG ACG ATG 

>A GL PTLRAFGRARGQVEVVRRMADGHRAATM 
92781 CG CACGCTG CGG AT CG CGTTC CTGT CCGCG CTGGTGCTGGAG CTGGTCG C CACC CTGT CGGTGGCGCTGGT CG CGGTG CCGGTGGG CATCCG 

>RTLRIAFLSALVLELVATLSVALVAVPVGIR 
92 8 73 GCTGCTCGGCGGCGGGCTGGCGCTGTCCACCGCGCTGCTGGTGCTGCTGCTCACCCCGGAGGCGTACCTGCCGCTGCGGGCGGCCGGCAGCC 

>LLGGGLALSTALLVLLLTPEAYLPLRAAGS 

92 965 GGTT C C ACG CCAG C ATGG AGGGG CTGGCCG CG CTGG ACGAGGCACTG ACCCTCTC CG C CG CCG ACCCG ACCGC C ACGGCCACCGCCGGGTCG 

>R FHASMEGLAALDEALTLSAAD PTATATAGS 

93 05 7 CGG CCCGTC CC CGACGGGCG CGCCG AGATC CGGTTCGAGGG CGTGACCGTCG CGT ACG AG CGG ACCGTGG CGCT ACGGGACGTCACGCTG AC 

>RPVPDGRAEIRFEGV TVAYERTVALRDVTLT 
9314 9 AATCCGGCCCGGCGAGCGGATCGCGATCGTCGGGCCGAGCGGCGCGGGCAAGAGCACCCTGCTCAACCTGCTGCTCGGCTTCGTCGCCCCGA 

> IRPGERIAIVGPSGAGKSTLLNLLLGFVAP 
93241 CGCAGGGCCGGGTCACCGTGGGTGGCGTCGACCTGGCCGGCGCGGACCCGGACGGCTGGCGGCGTCAGGTCGCCTGGGTGCCGCAACGGGCC 

>T QGRVTVGGVDLAGAD PDGWRRQVAWVPQRA 
9333 3 CACCTCTTCGCCGCCTCGCTGACCGACAACATCCGGCTCGGTGCCCCCGGCACGCCCGACGCCGCGCTCGCCGGCGCGGTCGCCGCCGCCGC 

>HLFAASLTDNIRLGAPGTPDAALAGAVAAAA 
934 2 5 GCTGGACGAGGTGGTCGCCGCCCTGCCCGACGGGCTCGACACCGTGCTCGGTGAGCGCGGGCACGGCCTGTCCAGCGGCCAGCGGCAGCGGG 

> LDEVVAALPDGLDTVLGERGHGLSSGQRQR 
93517 TCGCCCTGGCCCGGGCGTTCCTGCGGGACGCGCCGGTGGTGCTGCTGGACGAGCCGACCGCGCGGCTGGACACCGCCAGCGAGGCCGGGGTG 

>V ALARAFLRDAPVVLIjDEPTARLDTASEAGV 
93 6 0 9 CTGGCCGCCACCCGCCGGCTCGTCGCCGGGCGAACCGCCCTGTTGGTGGCCCACCGGCCGGCGCTGCTCTCCGACGCCGACCGGATCCTGCG 

>LAATRRLVAGRTALLVAHR PALLSDADRI LR 
93 7 01 GGTCGAGGAAGGCCGGGTCACCGAGCTGACCACCACCCCGGCCACAGGGGTGACCCCCGGCCCCGGCGAGGCGGCTGCCGGACCGGCCGGGC 

>VEEGRVTELTTTPATGVTPGPGEAAAGPAG 
93 7 93 AGGTCGCCCCCGCCCCGGCCGGAGAGGGGGCGGCCCGATGAGCACCGGTCCCGCCGACGACGCCTTCGCCATCCCGCTGCCGGCCGACGGG 

>QVAPAPAGEGAARo 

>MSTGPADDAFAI PLPADG 

93 8 84 GCCCCGGTGGCCGGCGGCAGCGTCCGGGCCGCCGAGCGGGCCGTGCTCCGGCTGGCCCGGCCGTACCTGGGCCGGCTGGTCGGCGCGGGTCT 

>A PVAGGS V RAAE RAVLRLA R PYLGRLVGAG L 
93976 G CT CG C CG C CG CC ACCG AGTTCG C CGGGCTGG CCCTG ATGGCCACCGC CACCTGGCTG CTG ATG AG CG C CGCCGGTCGGCCACCACTGG AC C 
>LAAATEFAGLAIiMATATWLLMSAAGRPPLD 

94 0 68 GGCTCACCGTGGCGATCGTCGCGGTCCGGGCGCTGGCGATCAGCCGAGGCGTGTTCCGCTACACCGAGCGCCTCGCCGGCCACGATGCCGTG 

>R LTVAI VAVRALAI S RGVFRYTERLAGHDAV 
9416 0 CTGCGGATGATCACCGACGTCCGGGCCGGGGTCTTCGCCGCCCTGGCCGCCCGGCGCG ACGCCGCCCGGCAGCGCACCGGGGACGCGCTGAG 

>LRM I TDV RAGVFAALAARR DAARQRTGDALS 
94252 CCGGCTCGTGTCCGACGTGGAGGCCGTGCAGGACCTGCTGCTGCGGGTGCTCGTCCCGGGGGCCGCCGCCACGGTGGTCAGCGTGCTGGCCG 

> RLVSDVEAVQDLLLRVLVPGAAATVVSVLA 
94344 TGGCCGGGGCCACCACCATCTCGCTCCCCGCCGCCGGGGTGCTGGCGCTGGGCCTGCTCGTCGCCGGGGTGGCCCTGCCGCTCGCGGCCACC 

>V AGATTI SLPAAGVLALGLLVAGVALPLAAT 
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94436 GCGCTGACCCGGCACGCCGCCGACCGGGTGGCCCCGCTGCGCGGCGCGCTCGCCACGGACGCCGTGGACCTTGTCCACGGCGCCGCCGACCT 
>ALTRHAADRVAPLRGALATDAVDLVHGAADL 
94528 GGCCGCGTTCGGTGCCACCGGGTACGCGCTGGACGCCGCCGCCGATCGGGCCCGCCGGCTGGCCCGGCTGGAACGACGGCTCGCCGCCACCG 

>AAFGATGYALDAAADRARRLARLERRLAAT 
9462 0 GCTTCGCCGTGGACGCCGCCGGGGCGCTCGTCGCCGGGGTGACCGCCGGCACGGTGGTGGTCACCGCGCTGCGCGACGGCGTCGGCGGGGTG 

>G F AVDAAGALVAGVTAGTVVVTALRDGVGGV 
94 712 CTGGTCGGGGTGCTGGCGGTCGGTTCCCTGGCCGCCGTCGAGGTGGCGCTGGCGCTGGTCGGGGCCGCCCGGCAGCGCACCCAGCTCCGGGC 

>LVGVLAVGSLAAVEVALALVGAARQRTQLRA 
94 804 CGGGCTGGTCCGGGTGGCCGCCCTGCTGACCGCCCCGCAGGCCGACGCGCCCGCCGCCACCCCGCCCGGTGCCGCCCGTGCCGCCGCCGTCG 

>GLVRVAALLTAPQADAPAATP PGAARAAAV 
94 8 96 GTGCCGGCCCGCACGACGTGCGCTTCGACGCGGTCACCGTGCGGTACCGGGCCGGCACCGCCCCGGCCCTGGACCGGGTCACCCTGGACCTG 

>G AG PHDVR FDAVTVRYRAGTAPALDRVTLDL 
94988 CCGGCCGGCCGCCGGGTCGCCGTGGTCGGGCCGAGCGGCGCCGGCAAGAGCACCCTCGCCGCCGTCCTCACCGGCACGGTGCGACCCGAGCA 

>PAGRRVAVVGPSGAGKSTLAAVLTGTVRPEQ 
950 8 0 GGGCCGGGTCACCCTCGACGGGGCCGACCTGTCGGCGTACCCGGTCGAGGAACTGCCCCGGGCCGTCGGCGGCCTGCTCGCCGAGGCGTACG 

>GRVTLDGADLSAYPVEEL P RAVGGLLAEAY 
95172 TCTTCCACGCCACGGTCCGGGAGAACCTGCTGCTCGGCCGGCCCGCCGCCGACGAGGCGGAGCTGACCGCCGCGACCCGGGCGGCCGGCCTG 

>V FHATVRENLLLGRPAADEAE LTAATRAAGL 
95264 CTGGACTGGGTGCACGCCCAGCCGGCCGGGTGGGACACCGTGGTCGGCGAGGAGGGCGGACAGCTCTCCGGCGGCCAGCGGCAGCGCCTCGC 

>LDWVHAQPAGWDTVVGEEGGQLSGGQRQRLA 
95356 GCTGGCCCGGGCGCTGCTCGCCGCGCCCGGGGTGCTGGTGCTCGACGAGCCGACCGAGGGGCTCGACCCGTCCGCCGCCGACGCGGTGCTCG 

>LARALLAAPGVLVLDEPTEGLDPSAADAVL 
954 4 8 CCTCGGCGCTGGCGGCGACCCCCGCCGGGCACTCGGTGCTGCTGATCAGCCACCGGCTCAGCGGGCTCGCCGACCTCGACGAGATCGTGGTG 

>A SALAATPAGHSVLLI SHRLSGLADLDE I VV 
95540 CTCGACGCCGGCCGGGTGGTCCAGCGTGGCCGGCACGACGAGTTGGTCGCCGCGCCGGGCTGGTACCGGGACCAGTGGCTGCTCCAGGAGGC 

>LDAGRVVQRGRHDELVAAPGWYRDQWLLQEA 
95632 GGCCGAGCGCGGGTACCTGGCCCTGACGCCCCGCCCCTGAGCCGGCTCCGGGATTCCCCCGGACGCGTCGGCAGTCACCGCATGGCAGGCT 

>AERGYLALTPRPo 

95723 CGTCGCATGGTGCGCTGCGACGACGTACTCGTGAAGGAGCGGCTGCGCGAGTTGAGCGACCGGCTGCACGGCCCGGCACGGCTCAAGGCCG 

>MVRCDDVLVKERLRELSDRLHGPARLKA 
95814 ACCTGCTGGCCGAGGCCCGCCACGCGTTGCAGGACGCCGTCGAGGCGTACCGGGACGGCGGGCTGCCGGCGGCGGAGGCCGAGCGGCGGGCA 

>DLLAEARHALQDAVEAYRDGGLPAAEAERRA 
95906 GTGGCCGAGTTCGGCGAGCCGGCCCGGCTCGCCCCGGCGTACCAGGCGGAGCTGGCGGCCGGGTCGCTGCGCGGCCTGTCCCTGCGGGTGCT 

>VAEFGEPARLAPAYQAELAAGSLRGLSLRVL 
95 998 CGCGGTCGCCGGCGTCCTGGTCGTCGCGGGCGATCTGACCTGGCAGGGGTCGAGCTGGAGCGGCGGCCCCGGCCCGCCGGCGGCCTACCGCC 

>AVAGVLVVAGDLTWQ.GSSWSGGPGPPAAYR 
96090 TGCTGTCCGCCTCGGTGGACGGCATCTGGCTGGGCGCGGTCGTGCTGTCGGTGGCGGGGTTGCTGCTGGTCGCCGCCTCGGCCCGGTGGGCG 

>LLSASVDG IWLGAVVLSVAG LLLVAASARWA 
96182 CACCCGGCCCTGCCCCGGCTCGCCCGGCTGACCGGTCTCGGGCTCACCGCCACGCTCGTCCTGGGCGTGGCGACCGGCGCGGCCCTGTACGC 

>HPALPRLARLTGLGLTATLVLGVATGAALYA 
96274 CTGGTCGATCGGGCTCTGGGAGGCGGCCCGCACCTGGCCGCCGATGCTCGTGGGCGCGCTGGTCTGCGGCGCGGGGTTCTTCTGGATCGGTC 

>WSIGLWEAARTWPPMLVGALVCGAGFFWIG 

junction marker 

96366 GGGCGGCCCGGTCCTGGCTGCTCTCGGCACGCCGACCGGCCGGACCGGCGTAGTCGGGTGGGCGGCGGGCGCGGTCAGGCCGGCGTGGCGG 

>R AARSWLLSARRPAGPA* <»APTAP 
964 57 GGGTGTCGCCGAGGAACTGGCCGACGGTCGCGCTGAACTCCCGCCAGCCGGCCCGTTCCCCGGCGAGGGCCCGGCGGCCGGAGTCGGTCAGC 

<TDGLFQGVTASFERWGAREGALARRGSDTL 
9654 9 TCGTAGGTGCGGCGCTCGCGGCCGTTGACGGTGCTCCAACTGCTGACCACGTGCCCCGCCCGCTCCAGCCGGCGCAGCGCCGGGTAGATCGT 

<EYTRRERGNVTSWSSVVHGARELRRLAPYIT 
96641 CCCGGTAGGCAGATCGAGGCTGCCCTCGCTGCGCGCGCGCAGCGCCTCGATGATGGCGTAGCCGTGCAGCGCGCCCCGTTCCAGCACCGCGA 

<GTPLDLSG ESRARLAEI IAYGHLAGRELVAL 
96733 GCAGCAGCGCGTCGAGGTGTCCGTGCAGCGCCTGGGCCTTCATAGGTAGCAAGACTACTTGTGGGCCACTCGCGCCGCCACCGGGGTGCGG 

<LLADLHGHLAQAKM 
96824 GCACCGGGCCTCCTAAGCCGCCCACTAGGGTATGTGCCCAGAGTCACTCGGCGGCCGGAGACGCCGCCGGGTGGGCAGCCCGAAGCACAC 

>VGSPKHT 

96914 GGAGGTCAGCGTGGCCCGCCAGTCGCCCCAACGGCCCGACGCCGACGAGCCTGAGCTCGACGAGACCGACGGCACCGCCGCAGAGGTCGAAG 

>EVSVARQS PQRPDADEPELDETDGTAAEVE 
97006 AGGACGGCGCGCGCCCGTCGGCGCAGGACGCCGACCGCGCGCTCTGGGACGAGCTGCGCATCGACCCGGTCGAGATCGCCCTGCCCGCCGGC 

>E DGARPSAQDADRALWDELR I D PVE IALPAG 
97 098 ACCGGCTACACGCTGCGGGCGTACCGGCCGGCACGGGAGTTGACCCCGACCGACGTCGCCGAGCGCGACCAGGACGACCCGTTCCTGGCCCG 

>TGYTLRAYRPARELTPTDVAERDQDDPFLAR 
97190 CCGGCAGGCGGTCGAGACCGACGAGGACGAGGACGAGGTCATCATCCTCGACGAGGAGGTGGCCGCCGAGTTCGCCGAGGCGGACGCGGAGG 

>RQAVETDEDEDEVI I LDEEVAAEFAEADAE 
97282 AGGCCGGCGGGAAGTCCCGCTCCCGCAAGCCCCGCGCCGACGCGGACTCCGACGACGCCGGAGCCGCCACAGACGCGGACGCGGAGGAGGAG 

>EAGGKSRSRKPRADADSDDAGAATDADAEEE 
97374 CCGGACTCCGACGAGGACGAGGCGGGCGACGAGGAGGTTCCGGTCTTCCTCAGCCACCGGGGCAGGCTGCTGCTGTTCAAGACGCCCGAATC 

>PDSDEDEAGDEEVPVFLSHRGRLLLFKTPES 
974 66 CCTCGTCAGCTTCGTCCGGTCCGGCGCACCCAACGACATGTCTCAACTGGACAGCTGGAATGAACTGTCCGAACGGGTGGAGCCGGCCGACA 

>LVSFVRSGAPNDMSQLDSWNELSERVEPAD 
97558 T CGT C CCG CTCG ACGAGG ACACCT ACG AG CTGG ACCTGGTCGTGG AG AACCTG CGGGGTGGG C ACG ACACCTGGG ACTCGGCG CTG CTG ATC 

>IVPLDEDTYELDLVVENLRGGHDTWDSALLI 
9765 0 GAGCCGGCGAGGTGGCCCGGGACGTCGCGTATGCCCTGCGTCTGCCCGCCGTGTTGGACATGCTCTCCGCCGGCTCCAGCCTCGACGACCTG 

>EPARWPGTSRMPCVCPPCWTCSPPAPASTTW 
9774 2 G ACG AGGCGCTG CG CG CC ACGG C CAACGGCGGG CT CGGGGGCTT CCTCGG C CGCCGG CGGCTG AGG AAAAT CGGCG CG CAG ACGGCG AGTC 

>TRRCAPRPTAGSGASSAAGG • 
97833 TCGGTTGG CG C AC CATTGTCGG C AAG ATCT CTG CGGTCGTGG ACTGG CG CG ACTG AC ACGTT C C AGGG AGCAT C AGTCTCTGGC AG AG AAAG 

BattlHI 
junction marker 

9792 5 ACC AGTCC CGGG AGG AGG ACGACG CTGTGG CG CTCGTG CGCG TGT ACTG CGGT CTGG CCTCGG CGG ATC CGG CCG ACCGACCGG C CT CGG CC 
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9 8 017 GGTTCGGCGCTGACGTCCGCTGTGGTCGACGACGCAGGTCGGCTGCTCCATGTCTGCGAGATCGGCGACGACCCAGCCGGCTACGCTCAGCT 
9 8109 GGTCGTGCTACTCGTGGAGCGGTCGGGCGGGCCGAGCGGGGCTGCGATCGCCGCCGACAGCGACGACCACACGGTCACCTCGCTGCTGAGTG 
9 8 2 01 CCGCCGGGCGTCCACTGGCGATCGCGGACGACGACTCGGTGGACGACTTCGCCGAGCGGTTCGCCGACGACGACTCGCTGGAGGAGATGCA 

> M Q 

9 8 2 92 GTCCGCGCCGGCCGAGCGGCGGGCGGTCGGGCTGGCCCGGGCGCTCCAGGCCGGCGCGCTCTCGGCGGTCACCCTCCCGGCGCCCCGGGATC 

> SAPAERRAVGLARALQAGALSAVTLPAPRD 

9 83 84 TCGCCGGCTACAAGCAGGTCCTCTCGGCGCACGCCGCGCTCGCCAGCGGCCGGCACTCCGCCGCCGTGGCGCTGCGCGAGGTGCTGCGGGAG 

>L AGYKQVLSAHAALASGRHSAAVALREVLRE 
9 84 76 CTCTACCCGGCCGCCCTGCGCGCGTACCCGGACCCGGCCGAGCCGGTCGCCCTGGCCGTGTTGGACGCCCTGCCCGAGCCCGGCATGCTGGG 

>LY PAALRAY PD PAEPVALAVLDAL PE PGMLG 
98 5 6 8 CGGGACGATCGCCCGGGGCCGGGAGGTGTCGGTGGCGGCGGACGCCATCGCCGCCCACCTCGCGGCCGACGGGGTGGCCGACGAAGGCAAGA 

>GTIARGREVSVAADAIAAHLAADGVADEGK 
9 8 6 6 0 TCAACGATGCGGTGACCGCGCTGCGGGTCGCCATCGCCGAGACCCCCCGCCGCGCGGCGGTCAGCCGGGCGCTCACCTCCGCCGTGGCGGAG 

>I NDAVTALRVA IAETPRRAAVSRALTSAVAE 
9 8 7 52 ACGGTCCGTCAGGCGGTGGCCTCGGTGCGAGCATGCGACGCGGGCTGCGAGGCCCTGGTCGGCGCGCTCGACGCCCGGGTCACCACCCCCAC 

>TVRQAVAS VRACDAGCEALVGALDARVTT PT 
98 844 CCCGGTGCCGGGCCGGCGGGCCGCCGCCCGCCGGGGCGAGCCGGTCGCCGAGTTGCCCGGCGCCGGCCTGCGCGCACTGCGGCCCACAGAGC 

> PVPGRRAAARRGEPVAELPGAGLRALRPTE 
98 93 6 CCGAGCCGGTCCCCGGCCGGCGCAGCCGTCCCGAGCCGGTCCCCGGCGGCAGCCTGCCCGCCCAGCCCCGCCCGCTCGGCCCGCCGCCGGTC 

>PEPVPGRRSRPEPVPGGSLPAQPRPLGPPPV 
9 902 8 GCGCCGGAGCCCGTCGCCCCGCCGCCGGTCGCCCCGCGCCCGATCACCCCGGCCGCCTCGGCCACCCCGCCGGTCTCCGGCCCGCCGTCGCC 

>APEPVAPPPVAPRPITPAASATPPVSGPPSP 
9 912 0 CGAGCCGCGCCGCCTGATCGACAACCCGGCCAACCGGCCGGTCTCCGCGCCGCCGCCCCCGCCACCGGGGAT CACCCCGATCGCGCCGAGCC 

>EPRRLIDNPANRPVSAPPPPPPGITPIAPS 
99212 AG CG CG AG CGCGGGTCCGTG CCG CCGG CCG AGG C CGGTG AGCCGTTC CGG CCCACG CTG ACC AC CGC CGCG ATCC AG AACGCG CGGG CGG AG 

>Q RERGSVP PAEAGEPFRPTLTTAAIQNARAE 
9 93 04 CGGCAGCGCACCATCATCCCGCCTCGCCCCAAGACGACGGGCGAGTCCGCGCCGCCGCCCACCGGCGGCTTCAGCGCCACCGACCTGAGCGT 

> R Q R T I I PPRPKTTGESAPPPTGGFSATDLSV 
993 96 CCCGGTGCCGACCCCGCGTCCCGGCCAGGAGTCCGCTCCCCCCGGCTCGCGGGCGAACTGGCCGCTGGTCAACAACCCCGAGGACCCCGCCG 

> PVPT PRPG QE SA P PG S RANW P LVNN P ED PA 

9 94 8 8 ACAGCTCCCCGAACAATCCCGTCGCGCGGCGGCCCTTGGAGGATCGGGCGAAGCGGCAGATCGACGCGCCGACCCAGGTGGTCCCGCCGGCC 

>D SS PNNPVARRPLEDRAKRQIDAPTQVVPPA 
9 95 80 GAGGGCCGGGTCACCCCGCCCTGGCTCGCCGACGACCTGCCCCAGGAGCCACCGATGCTGCGGCTGGTCGAGCCGCCACCGCTGGCCGACCG 

> E G R V T P PWLADDLPQEPPMLRLVE P PPLADR 
9 96 72 GG C ACTG CGGG ATGGG CCGGG C C AGG CTG C CG ACC CG CG C CTGG AG CCCCCG CCG CTG CGG CTGGTCG AC CG CGG CGAGGC AG CC CG CG C CG 

>ALRDGPGQAADPRLE.PPPLRLVDRGEAARA 
99764 GCCGTCCCGCGCCGGAGCCCCGCCCCGAGCGGGCCCCCGCGGAACACCGGTCCCCGCTGGGTCAGCGGGTCCCGTTGGAGGAGCGGCCCGAC 

>G RPAPE PRPERAPAEHRSPLGQRVPLEERPD 
9 9856 ATGG AACATCGGACCGCCCCGCCGCAGCCGTCGCGGTCCGCGCCGATGGAGCGGCGTACCCCGCCGATCTCCGACGAGGGGGACGGCGACCT 

>MEHRTAP PQPSRSAPMERRTP P I SDEGDGDL 
9 994 8 G CTG ATCTTCG CCGC CGCC AAG T CGG C CTGG TT CGTCGGGCACGGCG A CG AG TC CG AG ATGG ACTGGT CG AG C ACCG C CG ACACCGGGTGGC 

> L I FAAAKSAWFVGHGDESEMDWS STADTGW 
10004 0 AGGCCGCCGAGCAGGCCGCCCGCCCGGCGGTGGGCGCCGATACCAAGGCCGGGTTGCCCAAGCGGGTGCCGCAGGCCAACCTGGTTCCGGGC 

>Q AAEQAARPAVGADTKAGLPKRVPQANLVPG 
10 0132 T CC C CCCTG CG CG AGG AG CGTC C C CT ACGG AT AGT C CG CG ACG CGG C C AG C CTCG C CG AG AAC ACGACCGG CTACTT C CGGGG CTGG CGTCG 

>SPLREERPLRIVRDAASLAENTTGYFRGWRR 
100224 CGGGCAGGAGATCGGCGGGTTCGCCGTCGGCGGCCGGCCGGGCCGCGAGGCGGCCGGCGGCTGGGACTTCACCCGGGACACCGGCGACCGAG 

>GQEIGGFAVGGRPGREAAGGWDFTRDTGDR 
10 0316 ACG ACG ACCGGG AGTACG AGT AC CGGT C CG CCGG CT AC CG CTC CTG ACCATCGCG CCGGGGTGG CGGG CAC CG CCGCCC CGGC CG CG CC CG 

>D DDREYEYRSAGYRS * 

1004 07 CCGTGCTGGCGTACGGCCGTAGCAACCCGACGCCCACCCGACCGACCGGTGGGACCACACCGAAGAGCAACCCTGGCCGGCCGCCTGACGTC 
10 04 99 CCGCAGGGACGGTGACGGCTACTGGCCGTCCCCCCGGGAAGGTTGCGAGGCGGTCGGGGCGCACAGGCGCTGTCAGGGCCGCTCCTGAGCCG 

1005 91 CCCTACGGAATGGGCTAGCCCTACGGAATCGAGCGCCGGCCGCCCGCCGGTGGCGTCGAGTTCGCCTCCAGCCGGTCCCCAAACAGCCGGAC 
100 6 83 GCGCCCGACGGCCCGGCGGGTGCCGGTCGTCGGGCGCGTCCCGTCGTCGATCAGGCCGGCGCGACCGCGCGCGAGCGGCGCATCGTGAGCA 

<o A PA V A R S R RM T LV 
10 07 74 CGTACTCGACCAGCGAGATCAGCACGTGCTTCGTCGACTCCCGGTTCCGGGCGTCGCAGGCCACCACCGGCACGTCGTGCGAGATCGCGAGC 

<YEVLSILVHKTSERNRADCAVVPVDHSIAL 
10 0 866 G CGTC CCGG ACGTCCTGCGGGTCGTGGT ACTG CAT CC CGTCG AAG CAGTTG ATGG CC AC CAGGTACGGC AG CCG CCG ATG CTCG AAG AAGTC 

<A DRVDQPDHYQMGDFCNIAVLYPLRRHEFFD 
10 0958 GATGGCCGCGAAGCAGTCGGCCAGCCGCCGGGTGTCGACCAGGACCACCGCGCCGATGGCGCCCCGGACCAGCTCGTCCCACATGAACCAGA 

<IAAFCDALRRTDVLVVAGIAGRVLEDWMFWF 
101050 ACCGGGTCTGGCCGGGGGTGCCGAACAGGTACAGGATCAGGTCCCGGTCGATCGAGATACGGCCGAAGTCCATCGCCACCGTGGTCGTCGTC 

<RTQGPTGFLYLI LDRDIS IRGFDMAVTTTT 
10114 2 TCGCCCGGCACCTGCCGGGTGTCGTCGACGCCCACGCCGGCGGAGGTCATGATCGCCTCGGTGGTCAGCGGCGTGATCTCCGAGACCGAGCC 

<E GPVQRTDDVGVGASTMIAETTLPTIESVSG 
1012 34 GACCAGCGTCGTCTTGCCGACGCCGAACCCACCGGCGATAACGATCTTCGCCGACGTCACGCGCCCGCTCGGGACAGGCGGGCGGTGCGACA 

<VLTTKGVG FGGAIVIKASTVRGS PVPPRHSM 
101326 TGTCAGAGCCTGCGAAGTCCACTCAGCACCCTCTCCAGCAGTTCAGTGCCCACCGCGTCGTCGGAGTCGTCCAGGATGGTCGGCTCGTGGA 

<:<°LRRLGSLVRELLETGVADDSDDLITPEHV 
101417 CTGCGACCAGGCCGTCCGTCGCCATGTCGGCGATGAGCACCCTAGCCACGCCGAGCGGGAGCTGCATCCGCGCCGCGATCTCGGCAAGCGAC 

<AVLGDTAMDAILVRAVGLPLQMRAAIEALS 
101509 TG CACGCGTCCGT CGCAC AG CG CGGCG ATGT ACTGGTGCT CTCGG CCCTGG C C ACCGTTGCT ACTGG CAGCGG CCCG AC CGCG C ACCGTCGT 

<Q VRGDCLAAIYQHERGQGGNSSAAARGRVTT 
101601 CTCG ACG AG CGCCTCCAACGCGATGTCC AG CCG AGGGCGGGTACG ACCG CGGGTG ACGG CGTACGG ACGG ACC AACGCG CCAGTCGGCTCGT 

<EVLAELAI DLRPRTRGRTVAY PRVLAGT PED 
1016 93 CACGATCCATGTCGCCGCTCACCTCCTTCGTCCCCGACACCGGCTGAACCCGGTGGAACCCGTCGTTCTTGTCCTTGCCACCCCGCCCGAC 

< RDMDGSVEKTGSV 
101784 CCATCGGCCAGCGCGTGGGTCAGCCCATCATCCCCACAGTCGTACGCGGCTGCGGGGTCAACGCGTCGCCCACCCGGTCGACCAGGAGGGC 

<• GMMGVTTRPQPTLADGVRDVLLA 
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1018 7 5 CATCTCGTATCCGACCTGCCCGACGTCGCAGCTACGGGCGGCGAGCACGGCGAAGGACGAGCCGTCCGAGATGGACATCAGGAACAGGAAGC 

<MEYGVQGVDCSRAALVAFS SGDS I SMLF L FG 
10196 7 CGTTGTCCATCTCGACCACGGTCTGCAGCACCGCACCGCCCTCGAAGCAGCGTGCCGCTCCCTGCGTGAGGCTGACCAGCCCGGACGCGATC 

<NDMEVVTQLVAGGEFCRAAGQTLSVLGSAI 
102 05 9 GCGGCGAGCTGGTCGGCCCGGTCACGCGGAAGGTCTCGTGACGACGCCAGGAGCAGACCGTCCGCGGAGACGGCGACCGCGTGCGCG ACACC 

<A ALQDARDRPLDRSSALLLGDASVAVAHAVG 
102151 GGGCACCCGGTCGGCGAAGTTGGCCAGCAGCCAACCGAGATCCTGCGTAGTTGTCATCCTTGTTGCTCCTTCTGCCCGCTCCCGGCCACCG 

< P V <° GQQEKQGSGAVP 

102 24 2 GGCCTGAGCCAGACTGCGAGGATTGCTGCCCACCCGGAGCTGCCTCCGGGTTGGTCGGGTTGCCGTCGGGCTCGGTACGCCCACGCTGCACG 

<GSGSQSSQQGGPAAEPNTPNGDPETRGRQV 

1023 3 4 CCTCGATGGTATGCCGAGAGCAGACCGCGGACGCCCTCCGGCGTACGGCGCTGGACCGACGTGGTGGGCTTCTCCACCCCGCCAGGCACGAG 

<G RHYAS LLGRVGEPTRRQVSTTPKEVGG PVL 

1024 2 6 TTGGGCCATCGGCACCCGCTTCGGCAGGCCCTTGCGGGTGGTCTCCGCCACCGGGACCTCGGTGGCCGCCGAGGCGGCCCGCCAGCCGTCGT 

<QAMPVRKPLGKRTTEAVPVETAASAARWGDD 
102518 CCGCGGCAGTCTGCCAGGCGTGGGCCTGCGGCGTGGGCCGCCGGCCGGCGAAGCCCTCGGCGGGACCGGGGCGGGTGCCACCGTTGGTCGGC 

<AATQWAHAQPTPRRGAFGEAPGPRTGGNTP 
102 610 GACCCGTTGTCGCGCGGCATTCCTCCGGCCATCGGGGTGTCTGCCATCGGTGCGTTAGCTGTCGTCCCGGGTGCTGGTGTCTGGACGGGCCG 

<SGNDRPMGGAMPTDAMPANGTTGPAPTQVPR 
102 7 02 GCCGGTGACGTCGACCGCGGAGAACTGTTGGGTCACGGCGGCGTTCGCCGGGCTCCCGGCGCCATTGGTCGCGCGCTGGGCGACGCCGGCCG 

<GTVDVAS FQQTVAANA PSGAGNTARQAVGAT 
102 7 94 TCTCCTCCGAACCCGAGCGGCGGGTACGGAACCAGGCCGACTCGAGCTCCCGGAAGATCGGCAGCTCCATCGTCTCGTCCGCGTACCGCTGC 

<EESGSRRTRFWASELERFI PLEMTEDAYRQ 
102 8 86 TGCCGGTTCTGGGCCTGCACGGGCGTCGACCGCGCCGGCGTGGGCGGGGTCGTCGGCTGGGTGGCCGGCGTG"CTCGGCACCTCGGTGCTCGG 

<QRNQAQVPTSRAPTPP TTPC;TAPTSPVETSP 

102 978 CACCCGGGGCAGCTCCGTGGTCATGTCCAGGGCTGCGGCGAGGCGCTCCGGCACCGGCGGGGTGACCGGCTCCGGCGCGGCCACCGGCGGCC 

<VRPLETTMDLAAALREPVP PTVPEP AAVP PW 

103 070 AGGCCGGCGGCGCCACGGGCGCCTGGGCCGACGGGACCGGCCGACTGGGCAGCGGCTGCGCCGAGTACGGCTGACCGGACACGGGCGTGCCG 

<APPAVPAQAS PVPRSPLPQASYPQGSVPTG 
103162 AACGG CTG AC CGG AG ACGGG CG TG C CGAACGGCTGAC CGG AC ACCGGGG CGGCCG AG ACGGG CTG ACC CG AG AC CGGG AAG ACG G AAAC CGG 

<FPQGSVPTGFPQGSVPAASVPQGSVPFVSVP 
103254 CGGTGCGGACACCGGCGGCACCGAGACCGGTGGCGGGGTCCAGCCCCGGGCCTCCGGGCTGCTCGGCAGCTGTCGGGGGATGGCCGGCTGCT 

<PASVPPVSVPPPTWGRAEPSSPLQRPIAPQQ 
1033 4 6 GGCCGCTGCTGGCCGGATCGCCGTCGCCGGACGCCCGCCGCTGCGGCAGCGGGTCGCTCGACTGGCCATTCGAGGTGCGGCCGGCGGCGGCA 

<GSSAPDGDGSARRQPLPDSSQGNSTRGAAA 
103 43 8 CCGCCAGCGGTGCCACTGGCCCCGGTCAGGTCCGACCAGGCCGGCATCGACCGCATGGAACCGGTCGACGCGGGCGTGCCGTGGCCGTTGCG 

<G GATGSAGTLDSWAPM.SRMSGTSAPTGHGNR 
103 530 CGAGGCCGGGTCGAACGACCGGCCGCCCAGGGTTACCTGGTTGCCCGAGTGCCCCGGCCGCTGGGTGGGCGCCGGTGTCGCCGGGTTGTTGC 

<SAPDF S RGGLTVQNGSHG P RQT PAPTAPNNG 
103622 CGAAGGCCGCGAAGGCACCCAGGGCCGGGGGGGCCCCGCCGGGCTGCGAGCTGGTGAGCGAGGCGGGAGGGGCCGGCAGCGCGCCGGGCTGC 

< FAAFAGLAPPAGGPQSSTLSAPPAPLAGPQ 
103 714 TGGAACCGGCCGGAGAGCGCCCGGGGCACCAGCACGGTGGTGGGCAGGGTGACGTCGGCGACGGTCCCCCGGTCGGTGCCGGGCCGCAGCTC 

<Q FRGS LARPVLVTTPLTVDAVTGRDTGPRIjE 

103 8 06 gaccttgaccccgtgccgggacgccaaccgggcgaccacaaccaggcccatcatccgggagacggccacgtccacctgcggcggcgaggcga 
<vkvghrsalravvvlgmmrsvavdvqppsal 

103 8 98 ggcggtcgttgaggtcgtgtagctgctcggcgctgatgccgatgccccggtcctcgacgtagaggttggcccggtcgccgacccggcgggcc 

<rdnldhlqeasigigrdevylnardgvrra 
103990 tccaccatcacctgcgagtcgggcggcgagaaggcggtcgcgttgtcgaacagctcggcgaccaggtggaccaggtcgttgaccgcgtgcgc 
<e vmvqsd p ps fatandfleavlhvldnvaha 

104 082 gg cg acctcg atgtc acggt cg atc acc c cg aact cg atc cgggtgt ag tgctcg ac ctcgg actgggcgg cccg cagc acgt cg at c agtg 

<aveidrdivgfeirtyhevesqaarlvdila 
104174 ccgccggctcgcgctgcacgcgggtggagtcggccccggcgagcaccagcaggttctcgtcgttgcggcgcatccgggtggccaggtggtcg 

<aperqvrtsdagalvllnednrrmrtalhd 
104266 agctggaacagctcggccagccggtccgggtcctcctcgccgcgctccagccggtcgaggtggccgatcagccggtcgaccaggatctgcga 

<l qflealrdpdeegrelrdlhgilrdvli qs 
104 3 5 8 acgg cggg ccaggttg acg aacatggtcgcgacggaggcgcgcagcg cgg cctgctccgccgccgtccgtacggcctccaggtggaccgcgt 

<RRALNVFMTAVSARLAAQEAATRVAELHVAN 
1 04 4 S 0 TGAACGCCTCGGTCACCTGGCCGAACTCGTCCTTGCTGCGCACCGGCAGCGGCTCGGCGATCTGGTTGGCCGCCTGCACCGGGGAGAGCTGG 

< FAETVQGFEDKSRVPLPEA IQNAAQVPSLQ 
104542 CTGG AAAACTG CGG ATCGCG C AG C CGGG C AACGG CCTGGGGCAAACCGTACTGGG CG ATG CTG AG CGC ACCCTGGCGC AGGTCGCG C AG CG A 

<S S FQPDRLRAVAQPLGYQAI S LAGQRLDRLS 
104 6 34 GCGGGCCATCGACCGGGCGACCAGGTACGCGAACAGGATGGCCAGCAGCAGCATGCCGAGCAGCAGGCCGGTCTGGAGGAACACCGTGCGCT 

<RAMSRAVLYAFLIALLLMGLLLGTQLFVTRQ 
104 726 GTACGTCGGAGCGGAGCGCGTCGGCCTGCTTGACCACGTTGCCGTCGAGCTTCGCCTCGACCGTACGGATCAGTTTGGCGCTGGCGACCATG 

< VDSRLADAQKVVNGDLKAEVTRILKASAVM 
104 818 GCCGCGTCCCACTGATCCGGCCCGAACGGCGCGTTGGCCATGCTGCCGTTGGTGTTGCCGTCGAGCCAGCCGGTGTAGTTCTGCGCCTCGCG 

<A ADWQDPGFPANAMSGNTNGDLWGTYNQAER 

104 910 CCGGTCGCCGCCGGCGACGGTCTGGTCGTGCAGGTCCGACTCGTCCAGGCTGGCCACCGCCTTGAAGCTTTGCAGTGCCTGCTGCTGGCCGG 

<RDGGAVTQDHLDSEDLSAVAKFSQLAQQQGT 

105 002 TGCCGCTGGCGATGTAGTCGGTGCGCAGGATGGGGGTCAACTCGCGCTGGATCAGCGCCCGGTGCACCACGACCCGGCGGACCGAGAGGTAT 

<GSAIYDTRLI PTLERQILARHVVVRRVSLY 
105 094 TCCTTCTCCCGGGCGACGGCTGCCGCGGCCCGCATCCGGTCGCTCAGGTCGTTGTCACCGGCGAGGTGGGTGGCGGAGTCGCGGATGGACAG 

<E KERAVAAAARMRDSLDNDGALHTASDRI SL 
10518 6 C AGGT CGTTG AT C AGGC C CT CGT ACGC CTGCATGG CGT CG ATGATCTTC AACTTG CCGTTG AAG ACCTGG CTG CGGGTG CCGGGG AGGTCCT 

<LDNILGEYAQMADIIKLKGNFVQSRTGPLDK 
105 278 TCAGGTTCTGGTCGATCCCGTCGAGGAGGCCCTCCAGGCTGCTCGGCAGGCCGTCGACCTCCCCCCGCTGTTGCAGGTACGGCACCTTGTCC 

<L, NQDIGDLLGELSSPLGDVEGRQQLYPVKD 
1053 70 TGGTCGACCCGGATGTTGACCCGGTTGTACGCCTCCTGGTACTGCGCCTTGGCCTGGTCGCCGCTCGCCCCGAGCAGCAACACCGCGGAGGT 

<QDVRINVRNYAEQYQAKAQDGSAGLLLVAST 
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1054 6 2 GCGTTCGTCCTGGAGGCTGTTGACCAGGTCGCCCGAGTAGCCCACCAGATTGGCCAGGTCGCCGGAGCGGTTGGCGTTGTTCAGCGTTTCCA 
<REDQLSNVLDGSYGVLNALDGSRNANNLTEL 
10 5 5 54 GGTTGTCGACGAGGCCACTGGTGCCGACGACGACCGTGGCGATGGTCGGCACGATCATGATGAGACCGAGCTTGGACCAGATCGGCATGTC 

<NDVLGSTGVVVTAITPVIMILGLKSWI PM 
105 6 4 5 GCGGAGCCGGCCGACCGGCCGACGCAGTCGCGACAGGAAGGAACCCGCCGTCTTCGGTCGTTTGCTCACGTCACCGCCCTCGCGATCACAGC 
10 5 7 3 7 GTTCGCGCGTTGCCCCGGGCAACGCTCAGCGACCGACCCGGCGGGTCGGACCTCCGAGATTCCATCACGCCGTGTTCCAAAGAGAAAGCCCA 
105 8 2 9 GGCTGGCCGTCGCCGGAGGTGTGATGAGATGTTGATGCAATTTGCTCGCAATCCGTCCAGCCGGAGTGACTGACAGTAATGGATCACCCC 

> M D H P 

105 919 CACCGCCTCGTCCTGCTCGCCGGCCCTTCGGGCTCCGGAAAGTCGTACATAGCCCAACAAACCGGGCTTCCTGTTCTTTGTCTGGACGACTT 

>HRLVLL,AGPSGSGKSYIAQQTGLPVLCLDDF 

106 011 CTACAAGGATGGTGATGACCCTACGTTACCGCGCCAAAACGGTCTTGTGGACTGGGACTCACCCCAGTCGTGGGACGCCGGGGCGGCCGTGG 

>YKDGDDPTLPRQNGLVDWDS PQSWDAGAAV 
106103 AAACGATTGCCCGGCTGGCGCGGGACGGCAAGGCCGAAGTGCCGGTTTATGCGATCGGCGCGGACCGGCGGGTGGCCACCCGG ACATTCGAG 

>E TIARLARDGKAEVPVYAIGADRRVATRTFE 
106195 GTCGCCGGATCGCCACTTTTCGTCGCCGAAGGGATTTTCGCCGCCGAGATCGTCGAGGAATGCCGACGGCGAGGGCTGCTCGCCGGGGCGTA 

>VAGSPLFVAEGI FAAE IVE ECRRRGLLAGAY 

1062 87 CGCGCTGCGCCGGCCGCGCGGCACCACCTTTTTCCGGCGGCTCGCCCGCGACCTGGCCGAGCAGCGCAAGGCTCCCGGGATGCTGCTGCGGC 

>ALRRPRGTTFFRRLARDLAEQRKAPGMLLR 

1063 79 GCGGCCTGGCCCTGCTGCGCGCGGAGCCGGCGGTGCTGCGCCGCCAGGCGGGGCTCGGGGCTCACCCGGCCCCGGCCCGCGAGGTGCTGCGC 

>R GLALLRAEPAVLRRQAGLGAHPAPAREVLR 
106471 CGGGTGGCCGACCTGCTCGCCGGCCACCCGCACCACCCCTGATCAGCCCAGCAGCTTCCCGTACGCCGGCTTGATCACCTCGTCGATGAT 

>RVADLLAGHPHHP ° <»GLLKGYAPKIVEDI I 
106 5 61 GGCCAGCCGCTCGTCGAACGGGATGAACGCGCTCTTCATCGCGTTGATGGTGAACCATTGGAGCTCCTTCCAGCCGTAGCCGAAGGCCTCCG 

< A L R E D F P I FAS KMANI TFWQLE KWGYGFAEA 
106653 CCAGCAGCGCCATCTCCCGGGACATCGAGGTGCCGCTCATCAGCCGGTTGTCGGTGTTCACCGTCACCCGGAACCGCAGATCGCGCAGAAGC 

< LLAMERSMSTGSMLRNDTNVTVRFRLDRLL 

BamHI 
junction marker 

106 7 4 5 CCGATCGGGTGCTCGGCGATCGACGCCGCCGCGCCGGTCTGCACGTTCGACGACGGGCACAGCTCCAGCGGGATCCGCTTGTCCCGCACGTA 
<G I PHEAI SAAAGTQVNSS PC LELPI RKDRVY 

106 83 7 CGCGGCCAGCCGGCCCAGCACGGGCGGGTCGCCGGGGGTGATGTCGTCCACGATGCGCACCCCGTGGCCGAGGCGGTCCGCGCCGCACCACT 
<AALtRGL»VP PDGPT IDDV I RVGHGLRDAGCWQ 

106 92 9 GGATGGCCTGCCAGATCGACGGCAGCCCGAACGCCTCGCCGGCGTGAATGGTGAAGTGGAAGTTCTCCCGCTGCAGGTACTCGAAGGCGTCC 

<IAQWISPLGFAEGAHITFHFNERQLYEFAD 
107021 AGGTGCCGGGTGGGCGGGAATCCCGCCTCCGCCCCGGCGATGTCGAAGCCCACCACGCCGGCGTCGCGGTGCCGCACCGCCAGTTCGGCGAT 

<L HRTPPFGAEAGAIDF. GVVGADRHRVALEAI 
107113 CTCCTGCGACCGGGCGGCGTGCCGCATGGCGGTGAGCAGGGTGCCGACCCGGATCGGATGGCCGGCGTCGGCGGCGAGCGCCGCGCCCTCGG 

<EQSRAAHRMATLLTGVR I PHGADAALAAGEA 
107205 CGAACCCGGCGACGACCGCCTCGACCACCTCGTCCAGGGTCAGGTCCCGCTCCAGGTGCTGCTCGGGGGCGAACCGCACCTCGGCGTAGACG 

< FGAVVAEVVEDLTLDRELHQEPAFRVEAYV 

1072 97 ACCCCGTCGGCGGCCAGGTCCAGCGCGCACTCCTGGGCCACCCGCCGCAGTGCGGGCGCGGTCTGCATGACCGCCACGGTGTGGGCGAACGT 

<V GDAALDLACEQAVRRLA PATQMVAVTHAFT 

1073 89 CTCCAGGTAGCGCTCCAGCGAGCCGGAGTTCGCCGCCGCGACGAACCAGCGGCCGAGCGCTTCCGGGTCGGTGGTGGGCAGCTCGTGGCCGA 

<E LYRELSGSNAAAVFWRG LAE PDTT PLEHGV 

1074 81 CCTCGGCGGCCAGCTCGACGATCGTCGCCGGCCGCAGGCCGCCGTCGAGGTGGTCGTGCAGCAGCGCCTTGGGGACCTTGACGATGTCCTCG 

< EAALEVITAPRLGGDLHDH LLAKPVKVI DE 
107573 TATGAGATTGCGACCATGCCCAGACCCTAGTAGCGACCGGCCGGGTCGCCGACGCGACCGGCTGGGAGGATGTCCAGGTGATGGACCCC 

<YSIAVM >MDP 
107662 CGCATCGTCGACCGGCTGCGTTGCCCGGTCTGCGCGGAACCGCTCACCGAGGCCGCCGCCGGGACCACCCGGGCGCTGCGCTGCCCGCGCCG 

>R IVDRLRC PVCAEPLTEAAAGTTRALRC PRR 
107754 GCACAGCTTCGACGTGGCCCGCCAGGGGTACGTCGACCTGCTCGCCGGCCGGGCCCCGCACGTGGGCGACACCGCCGAGATGGTGGCCGCCC 

> HS FDVARQGY.VDLLAGRAP HVGDTAEMVAA 
10784 6 GCGCCGACTTCCTCGCCGCCGGGCACTACGACACGCTCTCGGCCGCGCTCGCCGCCGCCGCGCTCGCCGCGCTGAGCCACCCGCCGGAGGCC 

>R ADFLAAGHYDTLSAALAAAALAALSHPP EA 

107 93 8 C CCGG AG CGGACG CGT CGG CCGGC AAAG ACGGG C AGG ATG C CC AAGCCGGG CGGG ATG CGTCCGCTGG ACATG ACGCGTCCG C CGG ACAGCC 

>PGADASAGKDGQDAQAGRDASAGHDASAGQP 
10803 0 GGCCGTCGGGACGTACCCGCTGGTGGTGGACGCCGGGGCGGGCACGGGCCGGCACCTCGCCGCGGTGCTGGCGGCGCTGCCCGACGCCGTGG 

>AVGTYPLVVDAGAGTGRHLAAVLAALPDAV 
108122 GCCTGGCCCTGGACGTCTCCAAGCCGGCGCTGCGCCGGGCAGCGCGGGCCCACCCGCGCGCGGCGGCGGCGCTCGCCGACACCTGGCGGCGG 

>G L A L D V S KPALRRAARAH PRAAAALADTWRR 
108214 CTTCCGCTGGCCGACGCCAGCGTCGCCGTGCTGCTCGACGTCTTCGCCCCGCGCAACGGCGCGGAGTTCCGCCGGGTGCTCCACCCGGCCGG 

>LPLADASVAVLLDVFAPRNGAEFRRVLHPAG 

108 306 CGCGCTGCTCGTGGTCACCCCCGCCGAGGACCACCTCGCCGAACTGGTCGACTCGCTCGACCTGCTGAAGGTCGACCCCGACAAGGCGGACC 

>ALLVVTPAEDHLAELVDSLDLLKVDPDKAD 
10 83 9 8 GGGTCGCCGGGAGCCTGGCCGGCCACTTCGAGCAGACCGCCGAGAGCGTGCTGCGGGCCCGGCTGGAACTCACCGGCCGGCAGGTGGCCACC 

>R VAGSLAGHFEQTAESVLRARLELTGRQVAT 
10 84 90 CTGGTCGGGATGGGACCCAGCGCCTGGCACACCGACCCGGCCACCCTCGCCGCCCGGATCGCCGCGCTACCCGAGCCGGTCCGGGTGACCCT 

>LVGMGPSAWHTDPATLAARIAALPEPVRVTL 
10 8582 CGCGGTACGGCTCGGCGTGTACCGCCCCCGCTGACCGGGGCGCGGCCCGCCGCTCAGGTGGAAAGGTCGACCTCTTCCCAGCCCGGCGGCTC 

>AVRLGVYRPR« <©TSLDVEEWGPPE 
108674 CTCGTGGTAGGGCCCTCGCAGGACCACCGCCCACTCCAGCGCCCACCGGCGCTGCCCGATCGCGTTGGCGTCGACCAGGCCGGGCGGCGACC 

<EHYPGRLVVAWELAWRRQGIANADVLGPPSR 
10 8766 GCCCGTCGCGCTCCAGCTCCAGGTACGCCCAGTCGAGGCAGTAGTGCAGGTCGAGCAGGGCCGCCGCGTCGGCCGGGTGCTGGGGCGCGGCC 

<GDRELELYAWDLCYHLDLLAAADAPHQPAA 
108 858 AGGATGCGGGAGCGCCACTGCTGGAAGCTCTCCCCGCCCGCGATGTGCGGCAGCCGCTCCACCAGCCGCTCGTCGACCGGCAGCGTCGGGTC 

<L I RSRWQQFS EGGAIHPLREVLREDVPLTPD 
10 8 950 GAGCTGCTTGGCCAGGCCGAGCACCCAGGCCAGCGAGAACAGCGCGTCGTGGTGCAGCACGAACGACCGGTGGTCGCCCCGGCCGCCCATCA 

<LQKALGLVWALS FLADHHLVFS RHDGRGGMV 
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10 90 4 2 CGAACTGCCACTCCGGCGGGGTGACCAGGTCGACCAGGTGGGAGTTGAGCAGCCAGCTCATCGCCGCCTGCGCCGGCATGCCGAAACACCGG 

< FQWEPPTVLDVLHSNLLWSMAAQAPMGFCR 
1 0 9 1 3 4 GCCAGGATCACGTGCAGCACGGCGATGCGCGCCTCGATCTCGACGGTCGGCCGCAGCTCGATCTCGTCGCCCGGCTCCCACACCAGGGGAA 

<A LIVHLVAIRAEIEVTPRLEIEDGPEWV 
1092 2 5 ACTGGCTCGGTGGCAGCGGCAGCCCCAGCCGGGACAGCTCGTCCAGGCTGGCGTCACGGACCTCTCGCGGGTCGGGAGCGGAAACGCGCACG 
10 9317 GCTCAGATCCCTGTCAGTCGCATCGGCTCAGTGCCGGTCGTCCCCCTTGGCCTGGGAGGATAGCGGTTCACGACGAGCGGCACCACGGCGGG 
10 94 0 9 CGGGGGCGGGGGCGGTTCAGCCGATCCGCTCGATGACCAGCGGCTGCGGGGTCGGGGCGGTCGGCGAGATCCGTACCGCCCGGACCGCCTCG 
BamHI 

10 95 01 GCCAGCG CCGCCGGG ATCC 



Figure HAD 
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